CONSULTING 
PSYCHOLOGY 





. AMERICAN wolff ASSOCIATION 








JOURNAL OF CONSULTING PSYCH: 


Epwarp S. Borpin, Editor 
University of Michigan 


Bovp McCanoptess, Associate Editor 
Uniersily of Iowa 


Advisory Editors: John W. Atkinson, University of Michigan. Frank X. B 
Berkeley. John M. Butler, Umiversity of Chicago. Allen T. Dittmann, No! 
Seymour Fisher, Baylor University. Harrison G. Gough, University of Co! 
pi + hte 
R. Miller, University of Michigan. Alan K, Rosenwald, 


Shneidman, Veterans Administration, Los Angeles, California. Read D. Tudde: 


The Journal of Consulting Psychology is the clinical journal of t) 


LOGY 


of Michigan. Robert B. Malmo, McGill Uwiversity. a. 


Ar 


Association. It is devoted primarily to the publication of original resear 


logical diagnosis, psychotherapy and counseling, personality, and th 
Case studies, theoretical contributions, descriptions of clinical tech: 
the professional problems of clinical psychology also appear in its pay 


Manuscripts submitted for regular or early publication and man 
should be sent to the Editor: Edward S. Bordin, Department of ! 
Michigan; Ann Arbor, —— All manuscripts should be submitte 


Barly Publication. Authors may secure the early 


publication of an acceptable article in the first issue 


American Documentation | 


costs, materials supp! 
posited with the ADI 
Document Number as ci 
from ADI Auxiliary Pu 
tion Service, Library of ( 
The indicated fee must 
checks payable to ‘ 
Library of Congress 


Reprints. Authors 
Axiditional quantities n 
proofs. 





Helen Or 


Promotion M anager 


tions congarning subscriptions, change of address, c! 


| tages meee and other 
Og 1333 Sixteenth 


business matters, should be sent to t! 
St. N.W., Washington 6, D.C. A: 


office ty the 10th of the month to take effect the follo 
they a address changes will not be replaced; subscri! 


guarantee second-class forwarding postage. 


Otl 


= fied eyithin four months of publication. 


Lf Conendting Pesealogy published bimonthly in February, 


iT) 


University of Cakdfornia, 

litule of Mental Health. 
i, Berkeley. William L. 
E. Meeh!, University of 


Tat ag ersity of Illinois, Saul 
Rasnapete, Washington U: . Julian B. Rotter, Ohio State University. Nevitt 
California, Berkeiet. omens 9 Sra Yale University. Robert R. Sears, 


Sanford, University of 
wd University. Edwin S. 


1, Umiversity of California, 


nerican Psychologica! 


n relevant to psycho- 


lynamics of behavior. 


s, and discussions of 


pts for Brief Reports 
iology, University of 


» duplicate to facilitate 


titute. To save printing 


ry to articles may be de- 


obtain copies, order by 


in the reference footnote, 


ions Project, Photeduplica- 


ss, Washington 25, D. C. 


nitted in advance. Make 
Photoduplication Service, 


e fifty reprints gratis. 
ordered when returning 


for the nonreceipt of 
ierican Psychological 


ss changes must reach 
g month. 


Undelivered 
ould notify the post 
laims for undelivered 


June, August, October, 


at} Se gre eres a The subscription per year is $10.00, 
tale 
\ i asc the American Association, Inc 
Nee atg We and Lemon Sts., Lancaster, Pa. 
oP ad 1333 Sixteenth St. N.W., ” Washington, D.C 
a Second-class postage paid ut Lancaster. Pa. 
: tal a © 1961 by the American Paychoiogica!l Arsociation, Inc 
ape 














April 1961 Vol 25, No. 2 


Contents 


ARTICLES 


Clinical Perception of the Therapeutic Transaction: Bertram R. Forer, Norman L. 


Farberow, Herman Feifel, Mortimer M. Meyer, Vita 8. Sommers, and Ruth 8. Tolman 93 
Authoritarianism in the Therapeutic Relationship: John L. Vogel 102 
Social Desirability and Response to Perceived Situational Demands: David Marlowe 

and Douglas P. Crowne 109 
Attitudes toward Sex Roles and Feelings of Adequacy in Homosexual Males: Brenda 

A. Dickey ~ - . <a < 116 
4 Second Validation of a Long-Term Rorschach Prognostic Index for Schizophrenic 

Patients: Zvgmunt A. Piotrowski and Barry Bricklin 123 
Some Effects of Stimulus Variation on Spiral Aftereffect in Organic and Nonorganic 

Subjects: Ronald M. Sindéers - 129 
Intellectual Functioning in a Group of Normal Octogenarians: Andrew S. Dibner and 

James F. Cummin - « - 137 
Prediction of Relapse for Psychiatric Patients: David 7. Gout . 142 


The Representation of Physique in Children’s Figure Drawings: 4. B. Silverstein ana 
H. A. Robinson . . . 146 


Independence Training and First Graders’ Achievement: Sune Elizabeth Chance 149 


Magical Thinking and Associated Psychological Reactions to Blindness: Stanley P. 


Zarlock 1 


Speech Disturbance and Judged Anxiety: Donald S. Boomer and D. Wells Goodrich - 160 


st 
7A) 


Normal, Hypnotically Induced, and Feigned Anxiety as Reflected in and Detected by 

the MMPI: Albert A. Branca and Edward E. Podolnick : - 165 
The Effect of Brain Damage upon Speed, Accuracy, and Improvement in Visual Motor 

Functioning: Kenneth B. Stein . . 171 
The Influence of Context on the Depression Scale of the MMPI in a Psychotic Popula 

tion: Gordon W. Olson . . . . 178 


Suppressing Distortion in Temperament Inventories: Yack Hand and Herbert H. 


Re vn ld - . 180 


Brier Reports 


Sedatives and Suggestibility in Neurotic Patients: 7. G. Ingham and J. M. White 182 
Semantic Differential Rating of Self and of Self-Reported Personal Characteristics: 
Fames E. Madden : . i. ; ; — 183 


Field Dependence, Manifest Anxiety, and Sociometric Status in Children: Jra I/scoe 
and foyce Ann Carden - . lee ca Fa eee he : - 2 





Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specification 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


author’s lines, must not exceed 85 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style of 
the 1957 revision of the APA Publication 
Manual. Headings, tables, and references are 
avoided or, if essential, must be counted in 
the 85 lines. Each Brief Report must be ac- 
companied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 85-line quota: 


1An extended report of this study may be ob- 
tained without charge from John Doe (giving the 
author’s full name and address) or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. —— from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$— for microfilm or $— for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (a) The extended report 
should be typed with single spacing for 
economy in duplication. (b) Tables and fig- 
ures should be placed adjacent to the text 
which refers to them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual. 





Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 93-101 


CLINICAL PERCEPTION OF THE THERAPEUTIC 
TRANSACTION 
BERTRAM R. FORER, NORMAN L. FARBEROW, HERMAN FEIFEL, 
MORTIMER M. MEYER, VITA S. SOMMERS, anv RUTH S. TOLMAN 


Veterans Administration Outpatient Clinic, Los Angeles * 


An important facet of the diagnostic and 
therapeutic work of the clinician is that the 
data upon which he makes his decisions be 
both relevant and determinate. In all clinical 
fields there are data of such universality as to 
be of essentially no differential significance 
(Forer, 1959). Diagnosis in internal medicine 
is not furthered by such facts as the existence 
of a given organ. Similarly clinical psychol- 
ogists are minimally assisted in their decisions 
by knowledge that their patients are human 
beings, have problems, possess unfulfilled 
needs, and the like. Clinical practice presup- 
poses recognition and evaluation of character- 
istics that vary sufficiently among clients to 
permit the description of uniqueness. 

But this is not enough. It is of additional 
importance that characteristics in which 
people do vary be relatively stable for a given 
individual. If, for one client, a particular psy- 
chological characteristic were to manifest a 
temporal variation approaching that of a 
large sample of individuals at any one time, 
clinical description would be largely a matter 
of chance. The same would be true of the 
individual described by a group of observers 
whose ratings covered the total population 
range on a given trait. 

In an attempt to clarify some of the lan- 
guage of clinical observation and description 
and to experiment with suitable research 
methods for further work, the writers sought 
to determine in concrete clinical situations: 
how well a group of trained observers would 
agree on what they perceived, what kinds of 
clinical material could be perceived most reli- 
ably, and whether clarification of descriptive 
language and concepts would enhance agree- 
ment among judges. 


1Formerly part of Veterans Administration Re- 
gional Office, Los Angeles. 


93 


METHOD 


To this end a check-list was developed to include 
a sample of items which might be manifested in a 
therapeutic hour. Items were selected which would 
likely characterize some, but not all, clinical inter- 
actions. All items were cast in a form that permitted 
a rating of “present” or “not present.” The number 
of items changed as the experiment progressed as 
indicated in Table 1. Items were classified a priori 
by the judges as either “observational” or “infer- 
ential” according to the degree of extrapolation 
beyond immediate data believed necessary to make 
the clinical judgment. 

Judges were six diplomates in clinical psychology, 
five of whom had worked together in the same clinic 
for 8 to 11 years, and one who had been with the 
group for about 3 years and whose clinical training 
and background were similar in content and dura- 
tion. All were joint participants in training, super- 
vision, diagnosis, therapy, seminars, and research in 
a psychoanalytically oriented Veterans Administra- 
tion Mental Hygiene Service. 

The judges observed 50-minute psychotherapeutic 
sessions between a patient and a therapist through 
a one-way screen with a _ microphone-amplifier 
hookup. Judges made independent ratings without 
discussion. Observations consisted of four phases: 


A:-s. A male psychology trainee with a neurotic 
woman patient. They had been working in the 
observation room for some time, but did not know 
of the group’s observation. There were three weekly 
sessions followed by a 
some items. 

B:-s. A female psychiatrist with a schizophrenic 
male patient. They had been working together for 
some time in another room and the therapist moved 
into the observation room at the experimenter’s re- 
quest. There were three sessions followed by dis- 
cussion and revision of items. 

By». The same patient-therapist team. There were 
three more weekly sessions followed by discussion 
and more extensive redefinition 
items (Table 1). 

Ci. A male psychology trainee with a neurotic 
male patient whom he had been seeing in the room 
for some time for supervisory purposes. They knew 
nothing of the experiment. 


discussion and revision of 


revision and of 





Forer, Farberow, Feifel, Meyer, Sommers, and Tolman 


TABLE 1 


Cueck List For CLINICAL OBSERVATION 


rherapist was primarily active. \ctive: verbal activity)* 
Therapist was primarily comfortable (Comfortable : has sense of east th th patient Frame of reference 
should be our concept of comfort for all therapists.) 


Therapist’s major method was 
Reflective 
Interpretive 
Supportive 


The session seemed to be focused on a problem 


6a. Therapist attempted to focus on a particular problem, either content 
6b. Patient tended to ramble from topic to topic 


There were silences Silence: period when no one is talking, 30 seconds or 
If there were silences, they were usually broken by: 

a. the patient 

b. the therapist 


The hour was characterized by resistance 


(9a. Patient expressed verbal disagreement with therapist’s interpretations, regardless of unconscious factors 
(9b. Patient interrupted therapist more than once 

9c. Patient was halting in his speech: made pauses before words; incomplete sentences, more than once 
(9d. Patient spoke mostly in generalities.) 

(9e. Patient used technical psychological terms, more than once 


10. Patient spoke in a monotone. 
11. Patient was fidgety or restless 


Problem areas worked on were: 


12. Authority 

13. Sex 

14. Dependency 

15. Work 

16. Hostility 

17. Emotional control 

18. Symptoms 

19. Relationships with people” 

20. Patient used gestures. (More than once 

21. Patient brought up a lot of material. (Material: variety or elaboration of content 

22. The material brought up was deep. (Deep: (a) patient meaningfully relates something that is happening ir 

present to what has happened in past [content], or (b) produces something with great affect 

23. Patient was experiencing affect. 
f experiencing affect, it was: 

24. Anger 

25. Fear 

26. Sadness 

27. Anxiety 

28. Warmth 

29. Patient seems to be rigid. (This applies to fluidity and spontaneity during the hour, whether material is 

censored or uncensored, etc. Not a judgment of character structure 

30. Patient seems to show ability to form close relationships 

31. Patient seems capable of insight 

32. Patient seems self-critical.” 


33. (Patient’s relationship to the therapist during the hour seemed to be one of positive feelings or rapport 
This refers to conscious feelings 

34a. The nature of the transference is predominantly positive. 

34b. The nature of the transference is predominantly negative 


® Items in parentheses are clarifications or substitute items created during 
the last patient. 
> Items 19 and 32 added after fir 





Clinical Perception of Therapy 


rABLE 1 


Patient’s major defenses shown were 


35. Projection (Attribution to other 
Repression (Rationalizatior 
Avoidance (Denial 
Intellectualization 


avowed nonperception 

Explanation 
abstract concepts 

Isolation 


Separation of feeling and idea 


Reaction formation (Turning into the opposite 
Conversion (Displacement: shift of feelir 
Patient will complete therapy 

Patient will need long-term therapy, 14 years or 


Diagnostic impression 


three 
and 


Each of the four sections, then, consisted of 
weekly observations of the 
therapist. After each triad of observations the data 
were examined and definitions clarified with 
the goal of establishing a clearer basis for judgment 
Between By. and C;-, most items were defined as a 
result of consultation with three 
New definitions were mimeographed on the 
sheets 


same patient 


were 


psychoanalysts 
rating 
Some of the less clear items were replaced 
by more nearly observational items 

The degree of interrater described 
in terms of the binominal expansion in which 
p=¢a 5 for each item (present or absent), and nm 
is the number of raters, generally six, but occasion- 
ally five. There is 5 prob 
ability value for many clinical data since such data 
are apt to occur in a clinical sample with varying 
frequencies and clinicians are likely to differ- 
ential sets to perceive them. It seemed most reason- 
able, however, to treat the items as 
absence of other information 
raters, then, was expressed at 
values obtained from the 
also wished to know whether our judges agreed more 
on the observational than on the inferential items 
Hence compounding of probabilities was necessary 
To compound the probabilities over several series, 
p values were converted into chi square according 
to Pearson’s transformation: x loge p as de- 
scribed by Jones and Fiske (195 


agreement was 


reason to question the 


have 


pennies, in the 

Agreement 
first in terms ol p 
expansion. We 


among 


binominal 


2 An empirical investigation of » and gq was made 
by computing the proportion of present 
for each item over the 12 replications and setting up 
a distribution. The median value of 48 suggests that 
for the sample of items the assumption that p=q 
5 is a reasonable one 

3 The issue of independence of the tests is of im 
portance here. Most of the items had been planned 
to be as probably independent as possible. Measure- 
ment of independence in this study is impossible be- 
cause any item about 
will 
or negatively, 


responses 


which there is complete agree- 
pe rfectly, 


other 


ment necessarily correlate positively 
with item 
Whether the replications on the same patient can be 


considered independent events is still 


every unanimous 


an open ques 


of one’s feelings or behavior in terms of general or theoretical pring 


g irom or 


Continued 


person of motives unacceptable to oneself 
logical excuse or justification of feelings or behavior 
of reality situation, internal or externa! 


| r 
ipies OT 
ip! 


t or person to another 


Each chi square that enters into the compounding 
carries 2 df. Hence, the compound probability for a 
given item for three replications is the sum of the 
three chi square values and df=2*3=6. In 
similar fashion the compound probabilities of the 
amount of interrater agreement on the observational 
and inferential items were computed by totaling the 
chi square values for all observational and inferential 
items separately 
of items 


with df equal to twice the number 


judges agree 
and 
more in suc 


do the 
observational 


The important questions were 
more or less in their ratings of 
inferential items, and do 
cessive periods of 
experience and 


judges agree 


observation as a function of 
redefinition of Statistically 
these questions reduce to the significance of the dif- 
ference between total amounts of 
our measures of agreement 


terms ? 


agreement. Since 
are expressed in terms 
of chi square, the statistical test is of the difference 
between two chi 
only possible way of 


squares. To our knowledge, the 
testing this difference is by 
means of the F ratio. The F ratio is defined (Peters 
& Van Voorhis, 1940, p. 420) as the ratio of two 
independent chi squares, each divided by its own df: 


In this case x”, is the summation of the chi squares 
representing the amounts of agreement on all items 
in one treatment observational 
df, is the degree of freedom (e.g., twice the number 
of observational items). x*: and dfe are the 
sponding values for the other treatment (e.g., infer 
ential items) 


(e.Z., items) and 


corre 


RESULTS 


To attain a statistically significant degree 
of rater agreement in a given replication of a 
single item all raters must give the same judg- 


ment (for the values of m in this study). 
Unanimity yields a p value of .016 for six 


The findings, however 


that they were nearly independent 


tion nature ol our suggests 





Forer, Farberow, Feifel, Meyer, Sommers, and Tolman 


TABLE 2 


AMOUNT OF AGREEMENT AMONG JUDGES FoR Eacu ITEM 
AND OBSERVATIONAL PERIOD 


Patient A Patient B Patient C 


Item Total 


Observational 


21 
32 


Inferential 


03 
Al as. 
02 03 O02 .OO1 
02 02 01 
02 03 O02 O01 
02 03 O02 OO1 02 03 


Note.—All entries are expressed in probability values 
* Item deleted at this point. 

> Item added after this point. 

* Item replaced at this point. 





Clinical Perception of Therapy 97 


raters and .031 for five raters. In only one 
of the 12 replications did more than half of 
the observational items attain this degree of 
rater agreement. The same is true of the in- 
ferential items. The proportion of observa- 
tional items that showed significant agree- 
ment varied from 31.3% to 54.5% among 
the 12 observational periods. For the infer- 
ential items the range is from 28.0% to 
54.2% (Table 2). It is patent that significant 
agreement was not the rule. On two items 
only was agreement unanimous throughout 
the 12 replications: absence of reaction 
formation (Item 40), and need for long-term 
therapy (Item 43). 

Most, but not all, of the items were sig- 
nificantly in agreement when the probabilities 
were compounded over the 12 replications. 
Even so, most items varied enormously in 
degree of agreement from replication to repli- 
cation; hence overall significance of agree- 


ment gives little ground for confidence at any 
one time or for any one case in clinical 
observation. Description of the vicissitudes of 
a few items may be informative. The presence 
of monotonous speech (Item 10) was signifi- 


cant in only 5 of the 12 replications, restless- 


ness (Item 11) in 7 replications. Some content 
items were rarely significant. The problem 
area, hostility, a clinical favorite, was not 
once agreed upon unanimously. Presence of 
gestures (Item 20) was significantly agreed 
upon once in the series. 

Among the inferential items depth of ma- 
terial (Item 22) was significant twice—before 
the clarifying definition. Presence of anxiety 
showed perfect agreement three times—early 
in the series; capacity for insight showed per- 
fect agreement consistently for the first pa- 
tient’s three replications, and for the last 
patient’s as well, and not at all for the second 
patient’s six replications. Judgments about 
positive transference were never in complete 
accord; negative transference ratings were in 
accord for seven replications (all by ab- 
sence). Agreements in regard to ego defenses 
were as follows: reaction formation—perfect 
score, always absent; projection—four sig- 
nificant agreements split between two pa- 
tients; denial, intellectualization, and isola- 
tion—each three times; rationalization 
significantly agreed upon. 


never 


While psychological defenses, it may be 
argued, are rather subtle and may become 
apparent only in intensive, therapeutically 
oriented observation, the lack of agreement 
in six successive observations periods with the 
same patient seems cause for some concern. 
The judges agreed only once in nine replica- 
tions on the item: The hour was characterized 
by resistance. This item was replaced for pa- 
tient C’s observations by Items 9a through 9e 
which were deemed to represent some of the 
observational components of resistance. Dur- 
ing the three observations of the last patient 
9c was significant all three times, 9a and 9d 
twice, 9c once, and 9b not at all. If this 
finding can be generalized, it suggests that 
agreement about some clinical observations 
can be improved by specifying concrete 
behaviors. 

Results of the comparison between ob- 
servational and inferential items were un- 
expected. First of all, neither observational 
nor inferential items showed a significant 
preponderence of items with unanimous 
agreement as tested by a four-fold chi square 
test. When the combined probabilities of 
observational items were tested against those 
of the inferential items, not a single F ratio 
reached a .05 level of significance for the 12 
replications individually, for any of the 
patient-therapist combinations, or for the 
12 replications. Within the limitations of this 
experiment, then, there was no difference in 
rater agreement as a function of the degree 
of a priori objectivity of the items (Table 3). 

There was, similarly, no significant differ- 
ence in overall agreement (observational and 
inferential items combined) between patient- 
therapist combinations. That is, variations in 
the persons observed had no systematic effect 
upon the degree of agreement among the 
raters. Possible interactions with particular 
items may exist but are difficult to prove. 
And, finally and somewhat sadly, there was 
no improvement in degree of agreement as a 
result of practice, communication of criteria 
for rating each item, or specific definition of 
items. Some items improved and some deteri- 
orated. In fact, the highest summated chi 
square for blocks of inferential or observa- 
tional items or combinations of the two is 





Forer, Farberow, Feifel, Meyer, Sommers, and Tolman 


TABLE 3 


SUMMATED x? VALUES AND VARIANCES OF INTERJUDGE AGREEMENT IN RATINGS 
OF OBSERVATIONAL AND INFERENTIAL ITEMS 


Observational 
Observation 
Period 


Ay 67.4 
62.81 
80.06 

210.26 
92.65 
82.33 
65.84 

240.83 
77.50 
62.25 
79.07 

218.82 

459.64 

125.82 

126.25 

103.28 


355.35 


WOW wWh 
. 


w Ww 


Nm NM ht Ww & 


NmMN Nm Nb NW bh WO Ww Ww Wb - 0 
ty t : ; om . 


Nm NM W bh 


1,025.26 


not significantly different from the lowest 
value. 

When probabilities are compounded over a 
series of replications, it is possible for much 
variation in agreement to occur among repli- 
cations and still attain the .01 or .001 level 
of significance. To get the flavor of this 
variation it might be worthwhile to examine 
the behavior of one item. Item 31, capable 
of insight, was unanimously agreed upon for 
the three replications of Patient A; for Pa- 
tient B the agreement was 5/6, 3/6, 4/5, 
5/6, 3/5, and 5/6 (none of them significant) ; 
for Patient C all three replications were in 
total agreement. The compounded / value is 
beyond .001. Agreement was perfect for two 
patients and clearly in the direction of agree- 
ment, though not significantly, for the third 
patient. It may be argued that the nature 
of the patient is an important consideration. 
Perhaps so. Evaluation of insight possibilities 
in psychotic patients such as B may be less 
certain than in neurotics and the role of 
insight in improvement may be thought to 
differ as well. On Item 42 only the first three 


Inferential Total 


124.94 
115.95 
138.53 
379.42 
110.64 
107.34 

93.78 
311.77 
114.22 
104.77 
107.40 
326.39 
638.16 
109.52 
121.79 
119.82 


351.13 


192.34 
178.76 


218 SO 


NM NM bh 
vVnw vi 


589.69 
203.30 
189.68 
159.63 
552.60 
191.72 
167.02 
186.47 
545 20 
1.097.80 
235.34 
248.04 
223.1 


706.49 


CO ww w 


= 
K 


ow 


m MN Mm Ww WW he 


me mM bt he 


w wv 
soe uu 


NNN NN NN ND DW 
“i —! WwNw Nw tw 
worwn C 


Nm NM NM Ww W& NW WW le hv 
‘ ¢ t t 
> 


replications yielded significant agreement, yet 
p reached .01. 

In order to obtain some estimate of inter- 
rater agreement in quantitative terms, the 
judgments on all items for each judge were 
set up in four-fold tables singly with each 
other judge. Phi coefficients were computed 
for each pair of judges on each of the last 
three observation periods 

Phi coefficients ranged from .27 to .70 with 
median phi’s for the three successive periods 
of .49, 45, and .40 in that order. Even 
though the phi coefficients are lower than 
Pearson’s r’s would be, their values are higher 
than those of Gelfand, Quarrington, Wide- 
mann, and Brown (1954) on rating scales of 
Rorschach traits and Lisansky’s (1956) phi 
coefficients for questionnaire items derived 
from the Rorschach. They also exceed the 
intercorrelations obtained by Stern, Stein, 
and Bloom (1956, p. 113) from Q sorts on 
the basis of school records, projective data, 
and behavioral observations. While 38 of our 
45 phi’s are significant at or beyond the .01 





Clinical Perception of Therapy 99 


level, the magnitudes are still distressingly 
low for purposes of prediction. 

One judge was consistently highest in his 
median phi’s with other judges. There was no 
consistency as to who correlated lowest with 
the others. 

Of the items which were retained through- 
out the investigation, only two observational 
items (16 and 20) and one inferential item 
(34a) failed to be agreed upon beyond chance 
expectations. This finding can be interpreted 
in contradictory Since the judges 
agreed beyond chance on most of the items 
over the whole experiment, they were evi- 
dently perceiving something in the way of 
communality. On the other hand, they were 
less consistently in agreement than seems 
desirable and their agreement fluctuated in 
no predictable fashion. 

One can become lost in trivia and post hoc 
rationalizations in examining the behavior of 
specific items. Since a number of items 
showed unanimity in the judgments of the 
raters for one or more patients and not for 
others, it might be suspected that some pa- 
tions present more clear-cut evidence about 
some clinical variables than other patients do 
and that patients differ in the kinds of clinical 
data for which they show evidence. To be 
sure, patients vary somewhat from hour to 
hour and it may be expected that their ob- 
servers and therapists do also. Our data can 
be interpreted in whichever direction the 
reader’s bias lies. It can be argued that the 
variations in amount of agreement from inter- 
view to interview render the clinical data 
practically useless for a given clinical situa- 
tion. On the other hand, the fairly high level 
of agreement over the 12 replications indi- 
cates that there is significant communality of 
clinical perception. 

It should be remembered that the maxi- 
mum number of raters was six and that the 
divergent opinion of one rater makes quite 
a difference in the amount of agreement. A 
larger number of raters would lessen the effect 
of a single rater. 


ways. 


DISCUSSION 
A rough generalization that can be made 
from these findings is that for most of the 
items the judges agreed in the combined 


data beyond chance expectations, but that 
the degree of agreement varied from item to 
item, patient to patient, and replication to 
replication in no predictable fashion as others 
have found (Forer, Farberow, Meyer, & 
Tolman, 1952; Gelfand et al., 1954) in their 
study of Rorschach ratings. Such unsystem- 
atic variation in agreement does not neces- 
sarily mean that clinical observation is too 
subjective to be of practical significance. It 
does mean, however, that some of the pa- 
rameters of clinical observation and inference 
could, perhaps, be profitably re-examined and 
reformulated. 

We might ask whether the 
present experiment is a fair test of the clinical 
interview. Was the situation real enough; did 
it tap variables that are ordinarily involved 
in therapists’ observations? Would a therapist 
ask himself such questions during a therapy 
session, or are these judgments generally the 
result 
from 


ourselves 


of summarizing observations gleaned 
long-term contact with the patient? 
Yet, many of these variables are assessed at 
the end of single initial interviews with 
reference to diagnostic or therapeutic goals. 

Studies of the accuracy of clinical judgment 
and prediction have yielded little evidence of 
general superiority attributable to profes- 
sional training (Cline, 1955; Grigg, 1958; 
King, Ehrmann, & Johnson, 1952; Lisansky, 
1956; Luft, 1950, 1951). Our judges’ senior 
status suggests a fairly high level of profes- 
sional competence, but it does not imply that 
further development of skills or radical 
changes in orientation are unlikely to occur, 
even though we would ordinarily assume 
that they had reached an asymptote in their 
perception of most of the variables used in 
this experiment. Evidence is not impressive 
that homogeneity of training necessarily con- 
duces to homogeneity of judgment. Is it, 
then, a fact that for certain kinds of clinical 
judgment the variance attributable to indi- 
vidual differences among judges exceeds that 
attributable to professional training? It seems 
so in this study. We are forced to agree with 
Bendig (1956) that individual differences 
among judges may outweigh many facets of 
the rating process and the data to be judged. 

Discussion of after each series of 
three observation periods had no measurable 


terms 





100 


effect on degree of agreement. Lisansky 
(1956) believes that improvement can occur 
in the rating of Rorschach protocols, despite 
her and our (Forer et al., 1952) empirical 
findings to the contrary. Wiener’s (1958) 
belief that “We can and must train ourselves 
to agree on the judgments we make from 
projective test protocols” seems more a wish 
than a likely prospect. 

There is a possibility that amount of rater 
agreement is inversely related to the amount 
of data which the clinician must process. The 
task of sorting a large mass of heterogeneous 
data may create interference with the evalua- 
tion of any one of the classification variables. 
Evidence suggests that there may have been 
too much rather than too little information, 
possibly of a contradictory nature, and too 
many ratings competing for the observer’s 
attention (Borke & Fiske, 1957; Cutler, 
Bordin, Williams, & Rigler, 1958; Gage, 
1953; Giedt, 1955; King et al., 1952; 
Kostlan, 1954; Luft, 1950, 1951). 

Perhaps clinicians need to take stock of 
what they are asking from themselves, to ap- 
praise realistically rather than hopefully 
what is possible, so that they need not be 
unduly apologetic nor defensively nihilistic 
toward research evidence that questions their 
prowess. 

It seems unlikely from this and other 
studies that any conceptual system or any 
amount of training can engender the degree 
of conformity or reproducibility in clinical 
perception and judgment that is achieved by 
standardized tests or electronic computers. 
Would such a state of affairs be desirable? 
The growing supplementation and frequent 
replacement of objective tests by projective 
methods in diagnostic work suggests that ob- 
jective methods leave something to be desired. 
The price of objectivity is limitation of in- 
formation, and the clinician feels the need 
for more and different kinds of information 
than that provided by objective tests, even 
though the information be of a lower order 
of reliability. Factually he deals with pa- 
tients’ verbalizations which are also of a low 
order of reliability and it is through his 
inferences that the clinician constructs a 
relatively stable conceptual model of his 
patients. 


Forer, Farberow, Feifel, Meyer, Sommers, and Tolman 


Complete unanimity of clinical judgment 
would represent constriction of the range of 
cues and of clinical attention, hence of thera- 
peutic activity. Zero variance among thera- 
pists would likely generate low variance in 
therapeutic activity. But therapists differ 
inevitably as persons, in their preferred 
theoretical systems, in their ability to use 
particular techniques, and in their apparent 
effectiveness in dealing with different kinds 
of patients. The all-around therapist who 
works equally well with all kinds of patients 
is as much a myth as the psychological test 
that measures every aspect of the psyche. 

It may be that the less than perfect agree- 
ment in the clinical observations described 
above reflects those individual differences 
among therapists that enable them to special- 
ize, learn from one another, grow continually 
in their skills, and discover new concepts and 
techniques. 


SUMMARY 


As a means of investigating the reliability 
of psychologists’ perception of clinical data, 
six diplomates in clinical psychology observed 
three patient-therapist teams for a total of 
12 weekly psychotherapy sessions. Independ- 
ent ratings of present or absent were made on 
a check-list containing a number of presum- 
ably observational and inferential items. 
After each series of three sessions the 
clinicians discussed, redefined, and replaced 
items with the goal of increasing interjudge 
agreement. 


1. On very few items was there consist- 
ently significant agreement among the judges. 

2. The amount of agreement on most items 
varied from session to session and patient to 
patient in no detectable pattern. 

3. While the amount of agreement com- 
pounded over the 12 sessions was significantly 
beyond chance expectations ior most items, 
it was not sufficiently subsi.ntial to warrant 
confidence in the judges’ observations. 


4. The amount of agreement among judges 
was not affected by the apparent objectivity 
of the item. Even more precisely operation- 


ally-defined items 
onds or more 
upon. 


such as silence of 30 sec- 
were not consistently agreed 





Clinical Perception of Therapy 101 


5. There is no evidence that practice in 
judging, increased contacts with a particular 
patient, or discussion and 
items enhance objectivity. 


clarification of 


REFERENCES 

Benpic, A. W. The personality of judges and their 
agreement with experts in judging clinical case 
histories. J. consult. Psychol., 1956, 20, 422. 

Borke, Hetene, & Fiske, D. W. Factors influencing 
the prediction of behavior from a diagnostic in- 
terview. J. consult. Psychol., 1957, 21, 78-80. 

Curve, V. B. Ability to judge personality assessed 
with a stress interview and sound film technique 
J. abnorm. soc. Psychol., 1955, 50, 183-187. 

Cutter, R. L., Borpty, E. S., Wirrttams, Joan, & 
Ricter, D. Psychoanalysts as expert observers of 
the therapy process. J. consult. Psychol., 1958, 22, 
335-340. 

Forer, B. R. Psychological test reports: Universal or 
discriminating. J Disease, 1959, 129, 
83-86. 

Forer, B. R., Farserow, N. L., Meyer, M. M., & 
Totman, Ruts S. Consistency and agreement in 
the judgment of Rorschach signs. J. proj. Tech., 
1952, 16, 346-351. 

Gace, W. L. Explorations in the understanding of 
others. Educ. psychol. Measmt., 1953, 13, 14-26 

GeLFanp, L., Quarrincton, B., Wmemann, H., & 
Brown, J. Interjudge agreement on traits rated 
from the Rorschach. J. consult. Psychol., 1954, 18, 
471. 


nerv. ment 


Grevt, F. H. Comparison of visual, content, and 
auditory cues in interviewing. J. consult. Psychol., 
1955, 19, 407-416. 

Gricc, A. E. Experience of clinicians, and speech 
characteristics and statements of clients as vari- 
ables in clinical judgment. J. consult. Psychol, 
1958, 22, 315-319. 

Jones, L. V., & Fiske, D. W. Models for testing the 
significance of combined results. Psychol. Bull., 
1953, 50, 375-382. 

Kino, G. F., Enrmann, J. C., & Jonnson, D. M. 
Experimental analysis of the reliability of ob- 
servations of social behavior. J. soc. Psychol, 
1952, 35, 151-160. 

Kostitan, A. A method for the empirical study of 
psychodiagnosis. J. consult. Psychol. 1954, 18, 
83-88. 

Lisansky, Epitu S. The inter-examiner reliability of 
the Rorschach test. J. proj. Tech., 1956, 20, 310- 
317. 

Lurt, J. Implicit hypotheses and clinical predic- 
tions. J. abnorm. soc. Psychol., 1950, 45, 256-259. 

Lurt, J. Differences in prediction based on hearing 
versus reading clinical interviews. J 
Psychol., 1951, 15, 115-119. 

Peters, C. C., & Van Voornis, W. R 
procedures and their mathematical 
York: McGraw-Hill, 1940 

Stern, G. G., Stern, M. L, & Broom, B. 
in personality assessment. Chicago: 
1956. 

Wiener, M. The problem of interjudge agreement 
and prediction. J. proj. Tech., 1958, 22, 122-123. 


consult 


Statistical 
bases. New 


Methods 
Free Press, 


(Received April 11, 1960) 





Journal of Consulting Psychology 


1961, Vol. 25, No. 2, 102-108 


AUTHORITARIANISM IN THE THERAPEUTIC 
RELATIONSHIP’ 


JOHN L 


University of 


This study was concerned with the thera- 
pist, the patient, and their relationship in 
psychotherapy. It dealt with authoritarianism 
as a personality trait in each of the indi- 
viduals, and tested for associations between 
authoritarianism as a trait, attitude, and be- 
havior. A major hypothesis of this study was 
that the peculiar interaction of authoritarian- 
ism in therapist and patient would be crucial 
to the development of the therapeutic rela- 
tionship. Although this study did concern it- 
self with authoritarianism, this trait was not 
necessarily thought to be the most basic or 
critical aspect of the therapeutic relationship. 
It was selected for study here to demonstrate 
the importance of considering the personality, 
needs, and motives of therapist and patient, 
as they interact in the therapeutic relation- 
ship. 

Of the many personality variables that 
might be studied in this manner, the writer 
chose to consider authoritarianism, as deline- 
ated in the major work on The Authoritarian 
Personality (Adorno, Frenkel-Brunswik, Lev- 
inson, & Sanford, 1950). It was thought that 
the patient population of any clinic might 
not be as individualistic, equalitarian, and 
self-actualizing as some writers seemed to as- 
sume. Further, even a generally equalitarian 
patient may develop authoritarian expecta- 
tions about psychotherapy from his experi- 
ence with other professions. For the therapist, 
we know that there is a wide range of thera- 
peutic behavior in terms of training and ori- 
entation, to say nothing of the range of atti- 
tudes and needs they may have. Authoritar- 
ianism, then, was thought to be one of the 


1 Based on a doctoral dissertation submitted to the 
University of Chicago, 1959. The 
to Donald W. Fiske, Desmond S. Cartwright, and 
Ralph W. Heine for their encouragement and help 


writer is indebted 


VOGEL 


Washington 


trait dimensions relevant to patient and thera- 
pist roles. 

A major issue which still surrounds authori- 
tarianism as measured by the F Scale refers 
to the question of its behavioral implications 
and correlates. Titus and Hollander (1957) 
raise serious question about the relationship 
between authoritarian attitudes and behavior. 
They urge special caution where interper- 
sonal behavioral implications are to be drawn. 
Christie, on the other hand, makes a strong 
case for congruency of F Scale scores and 
predicted behavior, citing four studies in sup- 
port of his position (Christie & Jahoda, 1954, 
p. 145). As a test of the question, this study 
hypothesized that authoritarianism, as a trait 
of therapists and patients, would find expres- 
sion in their attitudes toward psychotherapy 
and their behavior in therapy. 


The second basic hypothesis of this study 
follows from the argument that a similarity 
of personality traits in patient and therapist 
tends to facilitate the relationship. Barron’s 
thesis (1950) seems to be the first study to 
consider both patient and therapist variables 
in an experimental approach to the thera- 


peutic relationship. Axelrod (1951) argued 
and found partial support for the hypothesis 
that progress in therapy was more likely when 
the personalities of patients and therapists 
were similar than when they were dissimilar. 
Underlying this hypothesis was the theory 
that 


the presence of an emotional identification or em- 
pathy between patient and therapist, springing from 
common emotional experience and manifested more 
or less by a similarity of personalities, is a condition 
favorable for the sful 


therapeutic process (pp. 4 


succes development of the 


Studies by Bown (1954), 
Libo (1957), and Ashby 


Hiler (1958), 
Ford, Guerney, and 


102 





luthoritarianism 


Guerney (1957) are pertinent considerations 
Although the evidence is 
than there 
seem to be a line of thought suggesting that 
there is an interaction between the person- 
ality traits of therapist and patient, and that 
generally a similarity of traits tends to facili- 
tate the relationship. This position receives 
some support from studies in the fields of 
leadership and education (Goldberg & Stern, 
1952; Haythorn, Haefner, Langham, Couch, 
& Carter, 1956; Jones, 1954; Sanford, 1950). 
The second basic hypothesis of this study 
states that a similarity of therapist and pa- 
tient the trait authori- 


of this question. 


something less substantial, does 


along dimension of 


tarianism—equalitarianism is related to the es- 
tablishment of successful or good therapeutic 
relationships. 


Sanford (1956) raises some question about 
whether authoritarian patients, without refer- 
ence to therapist traits, may not have real 
difficulty forming therapeutic relations with 
any therapist. He is rather blunt on this 
point, writing: 


1 


The person high on F rarely seeks, but rather resists 


the idea of psychotherapy; and once a start has been 


made, the technical problems are trying (p. 313 


Sanford then goes on to note a study by 
Freeman and Sweet (1954) in which they 
offered evidence that patients with many fea- 
tures of the F pattern actually respond better 
in certain forms of group therapy than they 
do in individual therapy. This argument, ob- 
viously, refers to patient traits only. As a 
more parsimonious explanation of therapeutic 
failure the question merits testing and forms 
a specific hypothesis cf this study 


METHOD 

Instruments 
The California 
tarianism was taken directly from Forms 45 and 4 
as published by Adorno et al. (195 On 


is best to use 


F Scale as measur! of authori 
item: “It 


some prewar authorities in Germany 


to keep order and prevent chaos,’ was omitted as 


untimely and probably ambiguous to most subjects 
Scores were derived in the conventional manner. Re 
sponses from 3 to were converted to positive 
scores ranging from 1 to 7, with no response 
as 4. The sum of scores for the 29 items was used 
for tests of hypotheses in this study 

The Authoritarian—Equalitarian Therapy sort (desig 
AET) developed for this 


scored 


nated as was especially 


in Psychotherapy 


103 


study. A 40-item card sort was constructed contain 


ing 20 items reliably prejudged as descriptive of an 


authoritarian therapy relationship, and items simi 


larly prejudged as descriptive of an equalitarian 
relationship. By 
asked to sort the 4 
5 items each. The piles were numbered from 1 to 8 
pile Number 1 designated as “Least False,’ 
pile Number 8 as “Most True.’ asked 
to sort the 40 items to indicate these 
things you would like to 
which of these things you 
least true, or false, 
tween you and your 


therapy verbal instruction the sub 


jects were items in 8 piles of 
True or 
Patients were 
“which of 
have be most true and 
would like to 


about the 


have be 
even relationship be 
doctor (therapist, counselor) .’ 
similar instructions 


expression ol 


Therapists were given essentially 
with added 
opinions” rather than what they 
had read. Each item was scored according to the pil 
number in which it 


emphasis on the “own 


had been taught or 
was placed The scores for the 
20 authoritarian items 
ject and designated the AET 
range from 50 to 130. For each patient-therapist pair 
an AET Discrepancy Score was computed by sun 
ming the squares of the score differences over the 4 
items. This Discrepancy 
tive function of the 
therapist sorts. The Discrepancy was 
ered an adequate representation of similarit 
differences of patient and therapist attitudes 
therapy along the specific dimension of 
equalitarian attitudes and behaviors 
A Therapist Rating Scak 
heavily from an instrument developed at the Uni 
versity of Chicago Counseling Rogers & 
Dymond, 1954, p. 101) and currently in use there 
Several items of the original 
produce a shorter rating blank. A new item was in 
troduced in which the therapist was asked to rate 
the “quality of the relationship,” “Does this 
seem to be a ‘good’ and 


were summed for each sub 


score, with 


a possible 


Score is, of course, a nega 
correlation between patient and 
Score consid 

and 
toward 
authoritarian- 


was developed, drawing 


Center 


form were omitted to 


thus 
effective 
tionship? How do you estimate the quality of th 


therapeutic rela 


therapeutic relationship between yourself and this pa 
tient ?” 


(nine-point scale from “poor” to “good”’) 


The rater’s estimate of patient satisfaction in the re 
thus 
relation 


lationship was retained in its original form 
“Estimate the patient’s feeling about the 
ship” (nine-point 
to “extremely 


from 
satisfied”). Only 
utilized in the present study 
The formation of 
relationships as a 
rectly related to the 
ployed in other studies, but it 


“strongly dissatisfied” 


these 


scale 
two iHems art 
successful or better therapeutic 
criterion was assumed to be di 
various types of criteria em 
was thought to have 
a specific pertinence of its own, as elemental or more 
It seemed reasonable to attempt a direct meas 
ure of the quality of the relationship. It 
sumed that the quality of the 
determined and may 
contacts of patient and therapist. Although a rating 
of patient satisfaction may not be one of the essen- 
tial goals of psychotherapy and may not be directly 
related to the quality of the relationship, it 
thought to be a useful supplementary criterion meas- 


basic 
was as 
relationship is largely 


be evaluated in the very early 


was 





104 


ure. No matter how good the quality of the relation- 
ship may appear to the therapist or judges, the de- 
gree of patient satisfaction with its implications for 
continuance in therapy or premature 
may be a crucial evaluation. 

An Observer Rating Scale was developed for the 
use of judges in rating patient and therapist behav- 
iors as observed on short recorded segments of ther- 
apy. Items 1 and 2 provided estimates of the qual- 
ity of the relationship and patient satisfaction, and 
were identical in form to the items described above. 
In Item 3 the therapist’s behavior in the recorded 
segment was rated on five dimensions: aggressive- 
submissive, directive-nondirective, highly anxious-low 
anxiety, dominating-equalitarian, and rigid—flexible. 
In Item 4 the patient’s behavior was rated on these 
five dimensions: aggressive-submissive, dependent 
self-sufficient, highly anxious-low anxiety, conven- 
tional-individualistic, and rigid-flexible. From the 
many qualities and behaviors attributed to authori- 
tarians in the literature, these five in each case were 
selected as being both relevant and ratable. In Item 
5 the judge was asked to rate the behavior of the 
therapist along the single dimension of authoritarian- 
equalitarian on a nine-point scale. In Item 6 a dis- 
tinction was made between dominant and submissive 
types of authoritarian behavior by the patient. Domi- 
nant behavior was defined by aggressive active au- 
thoritarian behavior, while submissive behavior was 
defined by passivity or deference, expecting or seeking 
authoritarian behavior by the other. Although domi- 
nant and submissive authoritarian behaviors were 
thought to be dynamically related, it seemed plau- 
sible to consider the two aspects mutually exclusive 
in any short sample of behavior. Thus, a V-shaped 
scale was used, with equalitarian at the apex and 
authoritarian-dominant and authoritarian-submissive 
at each of the two extensions, each on a nine-point 
scale. The judge was asked to select the aspect most 
prominent in the given segment and make a rating 
on the selected scale. 


termination 


Samples 


The subjects were drawn from two clinic popula- 
tions. Those designated as Group A include treatment 
cases in the Psychiatry Clinic of Albert Merritt 
Billings Hospital, University of Chicago. Senior medi- 
cal students are required, as part of their training in 
psychiatry, to treat in psychotherapy one selected 
patient who has been referred to the clinic. It should 
be noted that these students had little training and 
no prior experience in psychotherapy. All patients 
were told that their treatment would be limited to 
18 weeks’ duration, after which they would be either 
terminated or referred elsewhere. The present sample 
is composed of patients and therapists drawn from 
this program during two successive quarters. Of the 
35 patients originally tested for this study, 1 was 
eliminated because of a suggested organic involve- 
ment, 1 for an alleged inability to read, and 1 pa- 
tient who failed to keep the first and subsequent 
therapy appointments. The remaining sample of 32 


John L. Vogel 


cases included 15 males and 17 females, with a mean 
age of 38 years, ranging from 23 to 68 years. 

The subjects designated as Group B were drawn 
from the client population of the University of 
Chicago Counseling Center. Clients are normally as- 
signed to therapists on the basis of therapist avail- 
ability, and clients who agree to participate in re- 
search studies are then randomly assigned to proj- 
ects in progress at that time. The present sample 
includes 30 cases assigned to the writer’s project over 
a 6-month period. The therapists in this group in- 
cluded three staff members with extensive experi- 
ence, seven staff members with some or considerable 
experience, and seven students in training who were 
seeing their first or second cases. The client popula- 
tion included 16 males and 14 females, with a mean 
age of 27 years, ranging from 19 to 43 years. 

The population in Group A includes 32 patients 
and 32 therapists, each patient seeing a different 
therapist. In Group B, the population includes 30 
clients and 17 therapists, several therapists treating 
more than one client in this sample 


Collection of Data 


Patients and therapists were seen prior to their 
first therapeutic interview and were asked to com- 
plete the F Scale and AET sort. After the second 
therapeutic interview, the therapist completed the 
Therapist Rating Scale. 

Observer ratings were made on Group A only. Re- 
cordings of the first interview 
minute segments were selected from the beginning 
and ending of each interview. These segments were 
rerecorded in random order, with at least five other 
segraents between the two segments of any given in- 
terview. Two judges (the writer and another gradu- 
ate student of psychology, both with training and 
experience in psychotherapy) rated each segment on 
the Observer Rating Scale. Thus, for each case there 
were four ratings: beginning and ending segments by 
each of two judges. One recording was inaudible and 
tests based on judges’ ratings will be drawn from an 
N of 31. Reliability of judges’ ratings was tested on 
each of the 14 scales of the rating form. The two 
judges’ summed ratings (beginning plus ending seg- 
ments) were significantly correlated on 10 of the 14 
(9 at the .01 level, 1 at the .05 level: r= .38). Only 
these 10 items were utilized in this study. It is strik- 
ing that the four items on which the judges were not 
in agreement all dealt with patient traits. 


were retained, and 5- 


RESULTS 


Authoritarianism, as a personality trait of 
the therapist, was hypothesized to be signifi- 
cantly related to his description of the ideal 
therapeutic relationship in terms of directive, 
paternalistic, and nurturant qualities. Full 
scale scores on the F Scale were compared to 
AET scores. For the 32 therapists in Group A 
the Pearson r correlation was .03, a clearly 





Authoritarianism in Psychotherapy 105 


nonsignificant result. For Group B, with 17 
therapists, the Pearson r correlation was .62, 
significant at the .01 level. 

It was predicted that therapists character- 
ized by authoritarianism would tend to show 
more authoritarian b®havior in their therapy 
than those characterized as equalitarian. The 
Observer Rating Scales were utilized here. The 
31 therapists were dichotomized on the basis 
of their F Scale scores, 16 low and 15 high 
Results in tests of this hypothesis may be 
summarized as follows: (a) On a global be- 
havioral rating of authoritarian—equalitarian 
the high F scorers were rated significantly 
more authoritarian than low F scorers. (5) 
Although high and low F scorers did not 
differ on the full scale dimension of aggres- 
sive-submissive, they did differ on their de- 
viation from “appropriate” mid-point behav- 
ior, i.e., high scorers were given more extreme 
ratings on this dimension. (c) High scorers 
were rated as more directive, anxious, and 
dominating than low scorers, but not signifi- 
cantly so. (d) Behavior of high scorers was 
rated as significantly more rigid than that of 
low scorers. 

Authoritarianism, as a personality trait of 
the patient, was hypothesized to be signifi- 
cantly related to his description of the ideal 
or preferred therapeutic relationship in terms 
of directive, paternalistic, and nurturant quali- 
ties. Full scores on the F Scale were compared 
to AET scores. The 32 patients in Group A 
showed a Pearson r correlation of .34, signifi- 
cant at the .05 level. In Group B, with 30 pa- 
tients, the Pearson r correlation was .38, sig- 
nificant at the .05 level. 

It was predicted that patients characterized 
by authoritarianism would tend to show more 
authoritarian behavior in their therapy than 
those characterized as equalitarian. The Ob- 
server Rating Scales were utilized here: a 
global rating of patient behavior on a nine- 
point scale aid a rating on patient aggres- 
sion. As a test of this hypothesis the 31 cases 
were dichotomized on the basis of the pa- 
tient’s F Scale score, 15 low and 16 high. On 
the global rating of patient behavior the dif- 
ference between low and high scorers was not 
significant. The two groups did not differ on 
the full scale dimension of aggressive—submis- 
sive. High scoring patients did show the larger 


deviation from “appropriate” mid-point be- 
havior as predicted, but the difference be 
tween groups was not significant. 

In line with the argument of Sanford, dis 
cussed above, it was predicted that patients 
who are characterized as equalitarian will tend 
to form better therapeutic relationships than 
those characterized as authoritarian. In Group 
A, the hypothesis was tested against four cri- 
terion measures: the therapist’s rating of the 
quality of the relationship, therapist’s esti- 
mate of patient satisfaction, judges’ composite 
rating of the quality of the relationship, and 
the judges’ composite estimate of patient 
satisfaction. The differences between low and 
high scoring patients on the therapist ratings 
were not significant. The differences on judges’ 
ratings were both in the predicted direction. 
Judges rated the quality of the relationship 
significantly (¢ = 2.50, p< .01) higher for 
the group of low F scorers, and the estimate 
of patient satisfaction was slightly higher for 
this group but not significantly so. In Group 
B the hypothesis was tested against two cri- 
terion measures, the therapist’s rating of the 
relationship and his estimate of patient satis- 
faction. Differences between low and high 
scorers were not significant. 

The last three hypotheses were developed 
from the argument that similarity of patient 
and therapist personalities facilitates the de- 
velopment of good therapeutic relationships. 
It was hypothesized that patients character- 
ized by authoritarian traits would tend to 
form better therapeutic relationships with 
therapists characterized as authoritarian than 
with those characterized as equalitarian. For 
a test of this and the following hypothesis the 
dichotomies between high and low scorers in 
patient and therapist groups were retained. 
First, each of the patients characterized as 
authoritarian was considered with his respec- 
tive therapist. Mean criterion ratings are 
shown in Table 1. In Group A the hypothe- 
sis was tested against the four criterion meas- 
ures listed above. All differences were nonsig- 
nifigant. In Group B the hypothesis was tested 
against the two criterion measures listed 
above. Both differences were nonsignificant. 

Secondly, it was hypothesized that patients 
characterized by equalitarian traits would 
tend to form better therapeutic relationships 





John L. Vogel 


TABLE 1 


MEAN CRITERION RATINGS ON AUTHORITARIAN 
THEIR RESPECTIVE 


Group A 


Therapist 
Rating 


QR* 


Authoritarian Patients 
Authoritarian Therapist 
Equalitarian Therapist 

Equalitarian Patients 


5.89 
6.00 


Equalitarian Therapist 
Authoritarian Therapist 


* OR = Quality of Relationship, PS = Patient Satisfaction 
* Difference significant at .05 level, in a direction 


with therapists characterized as equalitarian 
than with those characterized as authoritarian. 
Each of the patients characterized as equali- 
tarian was considered with his respective 
therapist. Mean criterion ratings are shown 
in Table 1. For Group A, on the four cri- 
terion measures, all differences were nonsig- 
nificant. In Group B both therapist ratings 
were in a direction opposite to that predicted, 
with the difference on rated patient satisfac- 
tion significant at the .05 level. 

In the last hypothesis, therapist and pa- 
tient descriptions of ideal or preferred ther- 
apy conditions (AET) were utilized. Discrep- 
ancy Scores for each case were computed as 


TABLE 2 


MEAN CRITERION RATINGS ON CASES WITH HIGH AND 
Low DtsCREPANCY BETWEEN ‘THERAPIST AND 
PaTiENT AET Sorts 


»B 


Observer Th pis 
Rating Rating 


erapist 


Patient Group OR PS OR PS 
6.06 
6.31 


6.00 
6.19 


4.16 12 5.00 
4.55 58 6.33" 


High Discrepancy 


5.53 
Low Discrepancy 6.13 


* OR = Quality of Relationship, PS = Patient Satisfaction 
* Difference significant at .05 level 


AND EQUALITARIAN GROUI 
\UTHORITARIAN 


AND EQUALITARIAN THEI 


Criterion 
Group B 


Observe Therapist 


Rating Rating 


ypposite to tl 


previously described. These scores were di- 
chotomized in terms of low and high discrep- 
ancy. It was predicted that the quality of the 
therapeutic relationship would be related to 
the degree of discrepancy between patient and 
therapist expectations of authoritarian atti- 
tudes and behavior in therapy. Mean criterion 
ratings of low and high discrepancy groups 
are shown in Table 2. Although only one of 
the differences was statistically significant, 
low discrepancy cases received higher ratings 
on all criterion measures in both groups. 


DISCUSSION 


The failure to find a relationship between 
F Scale scores and attitudes toward therapy 
in the therapist population of Group A may 
be related to the nature of the F Scale items 
and the students’ them. It has 
been said 


reaction to 


that 
agree more with authoritative 
therefore, 


authoritarian pe scale 
and that, 
power ol 
rather than its con 


ple as measured by the 

tatements; 
a portion of the discriminatory 
the F scale derives from its form 
tent (Leavitt, Hax, & Roche, 1955, p 
The very authoritative tone of the statements 
in the F Scale, referred to as a form charac- 
teristic, may, however, operate with reactive 
effect on some subjects. Several of the thera- 





Authoritarianism 


pists (who, it will be recalled, were senior 
medical students) commented on the stringent 
wording of the statements. One student com- 
mented that: “In medical school one of the 
first things you learn is to suspect any state- 
ment with ‘always’ or ‘never’ in it.” These 
are not individuals who are rigidly or self- 
consciously equalitarian, but rather students 
trained to be critically sensitive to the literal 
meaning of words, and to hold in suspicion 
all authoritative sounding statements. The 
form component may, in such cases, have an 
inhibitory, and thus invalidating, effect. 

Since therapists’ scores on the F Scale do 
correlate quite well with their rated behavior 
in therapy it may be more reasonable to view 
their F Scale scores as a relatively reliable 
representation of authoritarianism as a per- 
sonality trait and to re-evaluate their expres- 
sion of attitudes toward therapy. It is well to 
remember that this population of therapists 
is composed of students with no experience 
and very little training in psychotherapy. 
They probably had few consciously developed 
attitudes toward therapy. By contrast, the 
therapists in Group B, with more training 
and experience in therapy, do show a con- 
sistency between personality trait and ther- 
apy attitudes. It may be proposed that one 
of the consequences of training and experience 
is the increased congruence of therapist traits 
and attitudes, a greater consistency between 
the personality of the therapist and his con- 
sciously held and expressed attitudes toward 
therapy. Whether such congruency is an ef- 
fect of training or experience, or both, could 
and should be tested. 

It was noted that the judges rated the qual- 
ity of the relationship significantly higher for 
the patient group of low F scorers, while dif- 
ferences between therapist ratings were not 
significant. It may be that this reflects some 
differences in conception of the “good pa- 
tient” role, and differences in what constitutes 
a “good and effective therapeutic relation- 
ship.” Some differences in perspective between 
therapist and judges may also be operative 
here. 

The finding that the rated quality.of the 
relationship is related to the degree of simi- 
larity of patient and therapist descriptions of 
a preferred relationship on items specifically 


in Psychotherapy 107 
defining authoritarianism tends to support 
the second basic hypothesis of this study. 
The quality of the relationship and an esti- 
mate of patient satisfaction in the early inter- 
views appear to be somewhat predictable. To 
say this in another way: there does seem to 
be some pretherapy data from which we could 
anticipate good or poor, satisfying or unsatis- 
fying, therapeutic relationships. 

An observation may be made on the failure 
to find a relationship between the criterion 
and similarity on F Scale scores. Dichotomiz- 
ing cases at the mean F Scale score for the 
group is probably too gross a division. For 
individuals not scoring in the extreme, high 
or low, authoritarianism is probably not the 
most crucial trait. The writer would speculate 
that for these individuals there are other 
traits, attitudes, and needs which play a more 
crucial role in determining the quality of 
their interpersonal relationships. 

It may also be observed that attitude items, 
the AET sort, have a greater immediacy or 
relevance to the therapy situation than F 
Scale items. Many AET items refer to atti- 
tudes or behaviors which are very soon con- 
spicuous by their presence or absence. By 
contrast, the F Scale measures a more funda- 
mental trait which may not express itself so 
immediately or directly. In spite of the care- 
ful manner in which the AET items were de- 
veloped, it may be that the sort contains sev- 
eral items of serious import to the develop- 
ment of the relationship, but not heavily 
loaded with authoritarianism. The method of 
deriving the Discrepancy Score, by summing 
the squares of the pile number differences over 
all items, gives an equal impact to all items. 

This discussion should not, however, ob- 
scure the finding that similar attitudes of 
therapist and patient toward therapy were re- 
lated to better therapy relationships. We are 
still some way from the point at which we 
can “match” patient and therapist to maxi- 
mize success in therapy. As a therapist, the 
writer doubts that research of this kind will 
ever take all of the “mystery” and the essen- 
tially personal quality out of psychotherapy. 
Research may, however, help us to avoid the 
more blatant difficulties, and thus permit the 
more individual aspects of psychotherapy to 
operate more effectively. 





John L. Vogel 


SUMMARY 


It was predicted that authoritarianism, as 
a personality trait of therapist and patient, 
would be reflected in their attitudes toward 
therapy and in their therapeutic behavior. 
Secondly, it was hypothesized that authori- 
tarianism and equalitarianism, as interacting 
personality traits of therapist and patient, 
would have specified effects upon the quality 
of the relationship established. 

A total of 62 patients and 49 therapists in 
two clinic populations completed the Cali- 
fornia F Scale and a specially devised in- 
strument in which they described the ideal or 
preferred therapeutic relationship. After the 
second interview these therapists completed a 
scale containing two criterion items: a rating 
of the quality of the relationship and an esti- 
mate of patient satisfaction with the relation- 
ship. In one of the two clinic settings, two 5- 
minute segments were selected from each of 
the first interview recordings. For each seg- 
ment, two judges rated the two criterion items 
and specific and general traits referring to au- 
thoritarian behavior on the part of the thera- 
pist and patient. 

Authoritarianism (as measured by the F 
Scale) was found to be related to authori- 
tarian attitudes toward therapy in both pa- 
tient populations and in one of the two thera- 
pist populations. The hypothesis that au- 
thoritarianism, as measured by the F Scale, 
would be related to authoritarian behavior in 
therapy was supported for the therapist popu- 
lation, but not for the patients. A test of the 
hypothesis that equalitarian patients would 
form better therapeutic relationships than 
authoritarian patients gave equivocal results. 
The second basic hypothesis, that similarity 
of therapist and patient on the specific di- 
mension of authoritarian-equalitarian would 
tend to facilitate the relationship, was not 
supported. There was, however, an associa- 
tion between criterion ratings and the amount 
of discrepancy between therapist and patient 
descriptions of the ideal or preferred relation- 
ship on items related to authoritarianism. 


REFERENCES 


Aporno, T. W., Frenxert-Brunswik, Etse, Levin- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality. New York: Harper, 1950 


Asusy, J. D., Forp, D. H., Guerney, B. G., Jr., & 
Guerney, Lovise F. Effects on clients of a reflec 
tive and a leading type of psychotherapy. Psychol 
M onogr., 1957, 71(24, Whole No. 453). 

Axe.rop, J. An evaluation of the effect on progress 
in psychotherapy of similarities and differences be 
tween the personality of patients and their thera- 
pists. Unpublished doctoral dissertation, New York 
University, 1951. 

Barron, F. X. Psychotherapy as a special case of 
personal interaction: 
Unpublished doctoral 
California, 1950. 

Bown, O. H. An investigation of the therapeutic re- 
lationship in client-centered psychotherapy. Unpub- 
lished doctoral dissertation, University of Chicago, 
1954. 

Curistiz, R., & JAnopa, M. (Eds.) Studies in the 
scope and method of the authoritarian personality 
Glencoe, Ill.: Free Press, 1954. 

FREEMAN, M., & Sweet, B. A theoretical formulation 
of some features of group psychotherapy and its 
implications for selection of patients. Int. J. group 
Psychother., 1954, 4, 355-368. 

Goxpserc, S., & Stern, G. The authoritarian person- 
ality and education. Amer. Psychologist, 1952, 7, 
375. (Abstract) 

HaytTuorn, W., Haerner, D., Lancuam, P., Coucn, 
A., & Carter, L. The effects of varying combina- 
tions of authoritarian and equalitarian leaders and 
followers. J. abnorm. soc. Psychol., 1956, 53, 210- 
219. 

Hirer, E. W. An analysis of patient-therapist com- 
patibility. J. consult. Psychol., 1958, 22, 341-347. 

Jones, E. E. Authoritarianism as a determinant of 
first-impression formation. J. Pers., 1954, 23, 107- 
127. 

Leavitt, H. J., Hax, H., & Rocne, J. H. “Authori- 
tarianism” and agreement with things authorita- 
tive. J. Psychol., 1955, 40, 215-221. 

Liso, L. M. The projective expression of patient- 
therapist attraction. J. Psychol., 1957, 13, 
33-36. 

Rocers, C. R., & Dymonp, Rosatinp F. (Eds.) Psy- 
chotherapy and personality change. Chicago: Uni- 
ver. Chicago Press, 1954 

Sanrorp, F. H. Authoritarianism 
Philadelphia: Institute for 
Relations, 1950. 


Prediction of its 
dissertation, 


outcome. 
University of 


clin. 


and leadership 


Research in Human 


SanrorD, N. The approach of the authoritarian per- 


sonality. In J. L. McCary (Ed.), Psychology of 
personality: Six modern approaches. New York: 
Logos, 1956. Pp. 253-319 

Titus, H. E., & Horranper, E. P. The California F 
Scale in psychological research: 1950-1955. Psy- 
chol. Bull., 1957, 54, 47-64 


(Received February 5, 1960) 





Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 1 115 


SOCIAL DESIRABILITY AND RESPONSE TO PERCEIVED 
SITUATIONAL DEMANDS 


DAVID MARLOWE 


College of Medicine, University of Kentucky 


Current research on social desirability 
(Cowen & Tongas, 1959; Edwards, 1957; 
Wiggins & Rumrill, 1959) has been chiefly 
concerned with a descriptive analysis of the 
influence of this variable on personality test 
responses. Along these lines, social desirability 
has achieved major status as a psychometric 
variable, the properties typically ascribed to 
it being those of a stylistic response determi- 
nant (Jackson & Messick, 1958). Pre-emi- 
nently, social desirability is considered to be 
a characteristic of test items (Edwards, 1957), 
and two models have been applied to its as- 
sessment. In the first of these procedures, 
items on a test are rated for social desirabil- 
ity by judges, and then responded to by sub- 
jects (Ss) under standard instructions (Ed- 
wards, 1953; Rosen, 1956); the correlation 
of the two sets of responses is inferred to in- 
dicate the amount of test response variance 
accounted for by social desirability. The sec- 
ond model involves the development of ra- 
tional or émpirical social desirability scales 
(Edwards, 1957; Hanley, 1957), the items of 
which show marked social desirability proper- 
ties. Correlations between these scales and 
various personality tests, such as the MMPI, 
are assumed to reflect social desirability bias 
in the test responses. This method, however, 
can also be employed to identify dissimula- 
tors, i.e., those Ss whose personality test re- 
sponses conform to the cultural stereotypes 
represented by the social desirability scale 
(Wiggins, 1959). 

The prevalent conceptions of social desir- 
ability thus reflect an exclusive concern with 
response distortion in psychometric situations, 
with an attendant narrowing of research in- 
terests to investigations of the social desir- 
ability scalability of test items. The concept 
of social desirability has not been systemati- 


DOUGLAS P. CROWNE 


Ohio State University 


cally investigated in terms of the motivation 
of Ss to dissimulate on personality tests and 
the relevance of this motivation to behavior 
in other, nontest situations.’ This latter con- 
ception suggests research in which the differ- 
ential influence of the need to respond in a 
socially desirable fashion would be investi- 
gated in situations where “self” or “item” 
evaluation is not the primary dependent vari- 
able. The present experiment was undertaken 
with this view in mind. 

In a recent report, the writers (Crowne & 
Marlowe, 1960) described the development 
and preliminary validation of a new social 
desirability scale (M-C SDS) and outlined 
the construct of which the scale is at present 
the sole operational definition. In the initial 
study, however, only the essentials of a mo- 
tivational concept of social desirability were 
suggested, and it is desirable here to present 
in further detail some of the implications of 
the construct. 

Social desirability, as presently defined, re- 
fers to a need for social approval and ac- 
ceptance and the belief that this can be at- 
tained by means of culturally acceptable and 
appropriate behaviors. In a _ psychometric 
situation, a high need for social approval 
would be inferred from a person’s attribution 

1Shortly after the completion of this experiment, 
Allison and Hunt (1959) reported a study investi- 
gating the relationship between Edwards SDS and 
aggressive responses to varying conditions of frustra- 
tion as measured by a paper-and-pencil test. They 
interpreted their results as indicating that “the [ag- 
gression] ‘suppressing’ effect of the SD factor occurs 
primarily in situations in which the culturally ac- 
ceptable response is not evident” (p. 532). While 
Allison and Hunt are careful to refer to social de- 
sirability as a “factor,” their recasting of Edwards’ 
concept as a “process” perhaps related to “other- 


directedness” implies a motivational usage similar to 
that of the present research 


109 





110 David Marlowe and 
of culturally approved statements to himself 
and the denial of culturally unacceptable 
traits. Most importantly, however, to assess 
the strength of social desirability motivation 
in a test situation one must be able to deter- 
mine the actual presence or absence of the 
traits, characteristics, or symptoms that are 
denied by the individual. Clearly, a need for 
approval would not necessarily be implied by 
the failure to attribute socially disapproved 
characteristics to oneself when these charac- 
teristics are not actually descriptive of the 
individual. In the development of the M-C 
scale, a psychometric model was employed 
which avoids the ambiguities arising from the 
failure to consider the actual incidence of 
traits represented in the test items. Items 
were selected for the M-C SDS from a de- 
fined universe representing behaviors which 
are culturally sanctioned and approved but 
which are improbable of occurrence. 

A low need for social approval implies a 
degree of independence of cultural definitions 
of acceptable behavior. The person less mo- 
tivated by a need for social approval might, 
in a testing situation, acknowledge certain 
symptoms, reject them as personally irrele- 
vant, or present other test responses depend- 
ing on such factors as the strength of his 
present needs, the kinds of responses re- 
quired, and the nature of the test stimuli. 

The present need construct clearly implies 
that “social desirability” has considerable 
generality beyond self-evaluative or test situa- 
tions, and this study was undertaken to as- 
sess the construct’s utility for predicting in- 
dividual differences in response to perceived 
cultural definitions of appropriate behavior. A 
situation was required that would be per- 
ceived by Ss as demanding of certain socially 
acceptable behavior. If Ss were presented with 
a boring, repetitive task and required to per- 
form it for a considerable period of time, it 
seems probable that frustration would ensue 
and that negative attitudes would be expressed 
towards the task. Were this boring task to be 
presented by an experimenter (E) who con- 
spicuously played the role of university pro- 
fessor, authority figure, and omniscient psy- 
chologist in the presentation of the experi- 
ment and the elicitation of attitudes towards 
it, Ss with high social approval needs might 


Douglas P. Crowne 


be expected to express more favorable (so- 
cially appropriate) attitudes than Ss less mo- 
tivated for approval. The spool packing task 
used by Festinger and Carlsmith (1959) 
seemed ideally suited for this purpose and, in 
slightly modified form, the attitude question- 
naire employed by them was deemed equally 
adequate. 

The definition of social desirability as a 
need for social approval and the belief that 
this can be attained by means of culturally 
acceptable behaviors would appear to overlap 
in some degree with the variable of conform- 
ity, and from the present definition of social 
desirability a relation with conformity would 
be predicted. The two concepts can be dif- 
ferentiated, however, in that the need for so- 
cial approval is a motivational variable, while 
conformity refers to a class of behaviors. Pre- 
diction of a relationship between social desir- 
ability and conformity assumes that conform- 
ity constitutes a category of behaviors avail- 
able to individuals seeking to gratify social 
approval needs. As regards this experiment, 
there nevertheless, a crucial question: 
would the two concepts differ in their utility 
for predicting the same behaviors? As a test 
of this the Independence of Judgment Scale 
(Barron, 1953), a paper-and-pencil measure 
of conformity, was included in the experiment 
to assess its value for predicting attitudes to- 
wards the spool packing task. 

Finally, since the present construct and its 
derived test differ from other definitions and 
measures of social desirability, the same re- 
sults would not be expected from other so- 
cial desirability scales. Accordingly, Edwards 
(1957) SDS was incorporated in the pres- 
ent design to determine its ability to predict 
the favorability of attitudes towards spool 
packing. 


1S, 


METHOD 
Subjects 


Fifty-seven undergraduate male students in intro- 
ductory psychology classes participated on a volun- 
tary basis in the experiment. The experiment was 
conducted at the University of Kentucky (26 Ss) 
and at Ohio State University where 31 Ss were ob- 
tained. 


Procedure 


The experimental procedure was 
two universities except for the us« 


identical at the 
of a different E 





Social Desirability and 


at each institution, and the administration of the Ed- 
wards SDS to 29 Ss at Ohio State only. The Ss were 
individually administered the spool packing task, a 
four-item questionnaire intended to elicit attitudes 
towards the packing task, the M-C SDS, the Bar 
ron Independence of Judgment Scale and the Ed 
wards SDS. Throughout the procedure, E 
maintained a professional and somewhat aloof man- 
ner, avoiding any conversation with S other than 
that necessary to conduct the experiment. The fol- 
lowing instructions were read to the S 
seated at a table directly opposite EZ 


entire 


who was 


My name is Dr. —_______.. "m a psy- 
chologist and I’m conducting an experiment on 
measures of performance 
on the experiment, I 
these questionnaires 
them. 


Before we get started 
would like you to fill out 


Sign your name on all of 


The Ss then completed the following scales: 

1. The M-C SDS, which consists of 33 items with 
true or false categories.2 An_ illustrative 
item is: “I never hesitate to go out of my way to 
help someone in trouble.” 

2. Immediately following the M-C scale, S com 
pleted the Barron Independence of Judgment Scale 
a 22-item questionnaire previously shown to be valid 
for discriminating male conformists from male non- 
conformists in an “Asch-type” situation (Barron, 
1953: Tuddenham, 1958). An illustrative item is 
“It is easy for me to take orders and do what I am 
told.” 

3. At Ohio State University, 29 Ss completed the 
Edwards SDS after the Barron scale. The 39 items 
comprising the Edwards scale were obtained from 
the MMPI L, F, and K scales and from the Taylor 
Manifest Anxiety Scale 

When S completed the last scale, he 


response 


was told 


Now for the experiment itself. The materials are 
this box and the 12 spools. I want you to take 
these spools, one at a time, and place them in the 
box. When you are finished, empty the box and 
refill it one spool at a time. Continue to fill and 
empty the box until I tell you to stop. Use one 
hand, and work at your own preferred speed 


’ 
S then packed and unpacked the box for 25 minutes 
while E held a stopwatch and pad, conspicuously 
pretending to be busily engaged in making notes on 


S’s performance. After 25 minutes, E said, 


O.K., that’s all we have in the experiment itself 
I hope you enjoyed it. You get a chance to see 
how you react to the task and so forth. I would 
like to know what your personal reactions are to 
the task and the experiment 
this questionnaire ? 


Would you answer 


The S then rated his reaciions to the experiment by 
answering the following four questions, taken from 
Festinger and Carlsmith (1959) 


2A complete description of the M-C scale may be 
found in Crowne and Marlowe (1960) 


Situational Demands 111 


1. Was the task interesting and enjoyable? 
Would you rate how you feel about the task on 
the scale below where —5 means extremely dull 
and boring, +5 means the task was extremely in- 
teresting and enjoyable, and 0 means the task was 
neutral, neither interesting nor uninteresting 

2. Did the experiment give you an opportunity 

to learn about your abilities and skills? Rate how 

you feel about this on a scale from 0 to 10 where 

O means you learned nothing and 10 means you 

learned a great deal 

3. From what you know about the experiment, 
and the task involved in it, would you say the ex- 
periment was measuring anything important? That 
is, do you think the results may have scientific 

value? Rate your opinion on this matter on a 

scale from 0 to 10 where 0 means the results have 

no scientific value or importance and 10 means 
they have a great deal of value and importance 

4. Would you have any desire to participate in 
another similar experiment? Rate your desire to 
participate in a similar experiment again on a scale 
from —5 to +5, where —5 means you would defi- 
nitely dislike to participate again, +5 means you 

would definitely like to participate again, and 0 

means you have no particular feeling about it one 

way or the other. 

The three scales, the spool packing task, and the 
spool packing questionnaire were presented in two 
orders for the purpose of controlling the possible in- 
fluence of a sequence effect. Half of the Ss packed 
the spools first, answered the four questions, and 
then completed the various scales, while the other 
half completed the three scales first and were then 
administered the task and the four questions. The 
instructions to S were modified in accord with the 
order of presentation used. Ss in the two conditions 
did not differ significantly with respect to means or 
variances on any of the measures, and the data were 
therefore analyzed without regard for the order in 
which the tasks were presented 


RESULTS 


As an initial step, the Ohio State and Ken- 
tucky Ss were compared with respect to 
means and variances on all the measures. No 
significant differences were obtained and the 
final analysis of the data was therefore based 
on the combined N of 57. It should be noted 
that significant results similar to these to be 
reported below were obtained when statistical 
analyses were carried out separately for the 
Kentucky and Ohio State samples. Thus, the 
findings that follow represent, in essence, the 
pooled results of a replicated experiment. 

In the major hypothesis of the study, it 
was predicted that individuals with a high 
need for social approval would express more 





David Marlowe and Douglas P. Crowne 


TABLE 1 


DIFFERENCES BETWEEN HiGH AND Low M-C 
SD Groups IN EXPRESSED ATTITUDES 


Low 
(N =27) 


Question Mean Mean Diff. 


How enjoyable tasks were 
(rated from —5 to +5) 


How much they learned 
(rated from 0 to 10) 


Scientific importance 
(rated from 0 to 10) 

Participate in similar 
experiment (rated from 
—5 to +5) 


** > < .01; one-tailed test 


favorable attitudes towards the spool packing 
task than Ss whose needs for social approval 
are relatively weaker. To test this hypothesis, 
Ss’ scores on the M-C SDS were dichotomized 
at the mean (14.93) to yield a high SD group 
of 30 Ss, and a low SD group of 27 Ss. 
Scores of the high group ranged from 15-29, 
while those of the low group were from 5-14. 
The differences between the mean ratings 
given to the four attitude questions by the 
two groups were tested for significance by 
means of ¢. The results of this analysis are 
contained in Table 1. 

Inspection of Table 1 indicates that the 
two groups differed significantly in mean rat- 
ings on each of the four questions. These dif- 
ferences are all in the predicted direction with 
the high SD group expressing significantly 
more favorable attitudes towards the experi- 
mental task than the low group. These find- 
ings support the general hypothesis that indi- 
viduals with a strong need for social ap- 
proval are significantly more likely to express 
attitudes congruent with perceived situational 
demands than individuals with a lesser need 
for social approval. 

To assess the relationship between scores 
on the Barron Independence of Judgment 
Scale and attitudes towards the spool packing 
task, an analysis similar to that carried out 
for M-C SDS was performed. Scores on the 
Barron scale were dichotomized at the mean 
(10.39) to yield a high conformity group 
(N = 31), and a low conformity group (N = 
26): Scores in the low group ranged from 2 
to 10, while scores in the high group ranged 


from 11 to 20. The mean ratings given to the 
four questions by the two groups were then 
compared. 

The findings reported in Table 2 indicate 
only one significant difference between the 
mean ratings given by the two groups. On 
Question 2, the ratings given by the high con- 
formity group as to how much they learned 
about their abilities and skills were signifi- 
cantly higher (¢ = 2.02, p< .05) than the 
ratings assigned by the low conformity group. 
This single significant difference out of a pos- 
sible four, indicates that the conformity vari- 
able has limited utility for differentiating in- 
dividuals in the favorability of expresed atti- 
tudes towards the spool packing task. 

The Edwards SDS, purported to be a meas- 
use of test-taking defensiveness—i.e., a meas- 
ure of a non-test-relevant response determi- 
nant—was included in the study as a contrast 
to the present motivationally defined con- 
struct. Scores on the Edwards scale were di- 
chotomized at the mean (32.34), and a high 
group containing 14 Ss (range of 33-39) and 
a low group containing 15 Ss (range of 24- 
31) were obtained. 

The significance of the differences between 
the mean ratings given by the high and low 
Edwards SD groups to the four questions 
was also measured by ¢ tests. Table 3 pre- 
sents these data, and indicates that no sig- 
nificant differences were obtained, with the 
four ¢’s clustering around a value of 0. Quite 
clearly, social desirability as measured by the 
Edwards scale is unrelated to attitudes to- 
wards the experimental task. 


TABLE 2 


DIFFERENCES BETWEEN HIGH AND Low Con- 
FORMITY Groups IN EXPRESSED ATTITUDES 


Question 


How enjoyable tasks were 
(rated from —5 to +5) 


How much they learned 
rated from 0 to 10) 
Scientific importance 
rated from 0 to 10) 
Participate in sin 
experiment (rated fro 
Sto +5) 


*p < .05; one-t 





Social De sirability and 


As a final step in the analysis of the data, 
the intercorrelations between the M-C, Ed- 
wards, and Barron scales were computed to 
determine the extent to which scores on these 
scales are related to each other. Table 4 con- 
tains the results of this analysis. 

Inspection of Table 4 reveals that M-C SD 
scores are significantly correlated with both 
Edwards SD scores (r = .56, N = 29) and 
with conformity scores (r = — .54, N = 57). 
Scores on the Edwards scale are uncorrelated 
with scores on the Barron scale (r = — .12, 
N = 29). We may conclude that individuals 
with a high need for social approval (M-C 
SD) tend to deny the symptoms and com- 
plaints represented in the Edwards SD items, 
and that a high need for social approval is 
also characteristic of individuals who give re- 
sponses on the Barron scale indicative of a 
relative lack of independence of judgment. 


DISCUSSION 


The major purpose of this study was to 
assess the utility of treating the construct of 
social desirability as a motivational variable 
applicable over a range of situations, in con- 
trast to the usual approach of employing 
measures of social desirability solely to ac- 
count for non-test-relevant response variance 
on personality questionnaires. That social de- 
sirability scales designed to measure a spe- 
cific test-taking attitude can account for a 
portion of the variance in responses to per- 
sonality tests has been amply demonstrated 
(Edwards, 1957; Fordyce, 1956; Wiggins, 
1959). There has been a general failure, how- 


TABLE 3 


DIFFERENCES BETWEEN HIGH AND Low 
Epwarps SD Groups In EXPRESSED 
ATTITUDES 


N = 15) 
Mean Diff 


How enjoyable tasks were 
(rated from —5 to +5) 
How much they learned 
(rated from 0 to 10) 
Scientific importance 
(rated from 0 to 10) 
Participate in similar 


experiment (rated from 
—5 to +5) 


Situational Demands 


TABLE 4 


CORRELATIONS BETWEEN M-C SD, EpwaArps 
SD, anp Conrormity SCALES 


Edwards SD 





M-C SD 
—.12 (N = 29) —.54°* (N = 57) 
.56** (N = 29) 


Conformity 
M-C SD 


*p < 01. 


ever, to consider the possibility that the dis- 
position to dissimulate in a test situation may 
be an expression of a generalized need to seek 
social approval. The findings of this study 
provide clear support for a theoretical ra- 
tionale which views social desirability in mo- 
tivational terms, regarding it as a need for 
social approval accompanied by a belief or 
expectancy that this need can be satisfied by 
engaging in culturally and situationally sanc- 
tioned behaviors. 

As predicted, the attitudes of the high M-C 
SD groups were significantly and uniformly 
more favorable toward the experiment than 
those of the low M-C SD group. We would 
suggest that the Es, as a consequence of their 
prestige and mildly authoritative manner (re- 
flected in their title, occupation, and behav- 
ioral aloofness), were perceived by the high 
M-C SD Ss as persons whose favor was worth 
courting. Consequently, these high M-C SD 
individuals were strongly motivated to yield 
to the demands of the situation: i.e., to tell 
the EZ that his experiment was interesting, 
important, personally informative, and worth 
returning to. In contrast, individuals less 
strongly motivated for social approval were 
better able to resist stating what seemed so- 
cially appropriate and to offer instead more 
realistic appraisals of the experiment. Pre- 
sumably, the less favorable opinions of the 
low M-C SD Ss reflect, in part, the greater 
freedom of this group from social pressures 
in the formulation and expression of their 
opinions. The significant correlation of —.54 
obtained between M-C SD and conformity 
would seem to support this formulation. 
Scores on the Barron scale, however, did not 
serve to discriminate the favorability of ex- 
pressed attitudes towards the boring task as 
well as M-C SD scores. Although one might 





114 


find certain similarities at a definitional level, 
we would conclude that the need for social 
approval and conformity (as measured by 
the Barron scale) are not by any means 
identical concepts. In terms of the present 
experimental evidence, conformity is perhaps 
best conceptualized as defining a class or 
mode of behaviors in which individuals with 
a strong need for social approval may engage 
in a particular situation. 

The Edwards scale had no utility whatso- 
ever for predicting differences in attitudes to- 
wards the experiment. In a situation where 
self-evaluation is not a relevant factor, the 
Edwards scale appears to be of little value in 
the understanding of motivational determi- 
nants of behavior. This is hardly surprising 
when one recalls that the items included in 
the Edwards scale refer almost exclusively to 
the presence or absence of symptoms and 
complaints, with a consequent resiriction of 
the behaviors that are represented in the item 
content. By way of contrast, items for the 
M-C SDS were selected with the intent that 
they be referents of a construct explicitly de- 
fined in motivational terms. 

Moreover, scores on the Edwards scale 
were uncorrelated with conformity scores (r = 
— .12), a finding which suggests, when added 
to other data recently reported (Wiggins, 
1959; Wiggins & Rumrill, 1959), that the Ed- 
wards scale may not be a “pure” measure of 
test-taking attitudes. To date, very high cor- 
relations have been reported between the Ed- 
wards scale and various MMPI scales and 
between the Edwards scale and the Taylor 
Manifest Anxiety Scale (Crowne & Marlowe, 
1960; Edwards, 1957; Wiggins, 1959). In 
contrast to these findings, considerably smaller 
correlations have been reported between the 
Edwards scale and tests less related to per- 
sonal adjustment (Crowne & Marlowe, 1960). 
It seems reasonable to suggest that the Ed- 
wards scale measures the extent to which an 
individual is willing to admit to symptoms in- 
dicative of maladjustment. Thus, we may ex- 
pect substantial relationships between the Ed- 
wards scale and other measures when there is 
a corresponding overlap in item content (par- 
ticularly that related to psychopathology). 

The present study may be viewed as an at- 
tempt to delineate elements in the nomologi- 


David Marlowe and Douglas P. Crowne 


cal net surrounding a defined construct of so- 
cial desirability. It seems quite apparent that 
the “meanings” which may be attached to the 
Edwards scale as a measure of social desir- 
ability are limited in scope and differ in ma- 
jor respects from the demonstrated and im- 
plied meanings of the present conception. The 
findings with respect to the need for social 
approval strongly support the hypothetical 
properties ascribed to it. As Cronbach and 
Meehl (1955) have noted, successful predic- 
tions with diverse criteria support the claim 
of construct validity more forceably than do 
predictions involving very similar behaviors. 


SUMMARY 


An attempt was made to assess the utility 
of defining the construct of social desirability, 
in motivational terms, 


as a need for social ap- 
proval. A new social desirability scale previ- 


ously developed to measure this variable was 
administered to subjects at two universities. 
For comparative purposes, the Edwards So- 
cial Desirability Scale and the Barron Inde- 
pendence of Judgment Scale were also in- 
cluded in the study. 

Subjects performed a boring task for 
minutes, and then rated their attitudes to- 
wards the experiment. The major hypothesis 
of the study predicted that individuals with 
a strong need for social approval would ex- 
press significantly more favorable attitudes 
towards the experiment than individuals with 
a relatively weak need for social approval. 
The significant findings reported confirmed 
this prediction. Scores on the Edwards and 
Barron scales were not significantly related 
to the favorability of the subject’s attitudes. 

The overall results were interpreted as con- 
tributing to the delineation of the properties 
which may be attached to two current defini- 
tions of the social desirability variable. 


REFERENCES 


& Hunt, D. E. Social desirability and 
the expression of aggression under varying condi- 
tions of frustration. J. consult. Psychol., 1959, 23, 


528-532 


ALLIson, J., 


Barron, F. Some personality correlates of independ 
ence of judgment. J. Pers., 1953, 21, 287-297 

Cowen, E. L., & Toncas, P. N. The 
ability of trait descriptive terms: 
a self-concept inventory. J. consult 


23, 361-365 


social desir- 
Applications to 
Psychol., 1959. 





Social Desirability and Situational Demands 115 


Cronsacu, L. J., & Meent, P. E. Construct validity Jackson, D. N., & Messick, S. Content and style in 
in psychological tests. Psychol. Bull., 1955, 52, personality assessment. Psychol. Bull., 1958, 55, 
281-302. 248-989. 

Crowne, D. P., & Martowe, D. A new scale of so- 
cial desirability independent of psychopathology perceived social desirability of personality traits 

jy Pony ( ) ? 0~3 54 2 > a 

J. consult. F y hol, 1960, 24, 34 4 be: J. abnorm. soc. Psychol. 1956; 52, 151-158 

Epwarps, A. L. The relationship between judged de-__, ie , 
= [Tuppenuam, R. D. Studies in conformity and yield- 
sirability of a trait and the probability that the ‘ a : 

- = ae , 3 » ing: VII. Some correlates of yielding to a dis 
trait will be endorsed. J. appl. Psychol., 1953, 37, : 
90-93 torted group norm. ONR tech. Rep., 1958, No. 8 

} . 

Epwarps, A. L. The social desirability variable in Contract NR 170-159 
personality assessment and research. New York Wiccrns, J. S. Interrelationships among MMPI 
Dryden, 1957. measures of dissimulation under standard and so 

Festincer, L., & CARLSMITH J. M. Cognitive conse cial desirability instructions. J. consult Psychol., 
quences of forced compliance. J. abnorm. soc. 1959, 23, 419-427. 

Psychol., 1959, 58, 203-21¢ Wiccrns, J. S., & Rumer, C. Social desirability in 

Forpyce, W. E. Social desirability in the MMPI. J. the MMPI and Welsh’s factor scales A and R. J 
consult. Psychol., 1956, 20, 171 consult. Psychol., 1959, 23, 100-106 

Haney, C. Deriving a measure of test-taking de- 
fensiveness. J. consult. Psychol., 1957, 21, 391-397 (Received February 11, 196 


Rosen, E. Self-appraisal, personal desirability, and 











Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 116-122 


ATTITUDES TOWARD SEX ROLES AND FEELINGS 
OF ADEQUACY IN HOMOSEXUAL MALES’ 


BRENDA A. DICKEY 


University of Colorado 


Many writers have seen homosexuality as 
either a concomitant or a cause of maladjust- 
ment and neurosis, although a few theorists 
have felt that homosexuals may be either 
adjusted or maladjusted. Evelyn Hooker’s 
(1957) empirical results suggest that there is 
some justification for thinking that homo- 
sexual males may vary in their degree of ad- 
justment. A major assumption underlying this 
research is that homosexuals, as well as het- 
erosexuals, may, indeed, be differentially ad- 
justed, and this study attempts to investigate 
factors that may be associated with varying 
degrees of adjustment in the homosexual male. 

Factors which may be important are sug- 
gested by Bennett and by Hooker. Bennett 
(1947) emphasizes the fact that the hetero- 
sexual majority in a disapproving society 
tends to increase the homosexual’s sense of 
isolation through its selective treatment of 
him. The heterosexual is able to choose freely 
his social isolation; the homosexual is not. 
The only emotional surcease he may find is 
in the company of others like himself. In an- 
other article, Hooker (1956) agrees with 
Bennett that the homosexual male’s adjust- 
ment may be greatly facilitated by associa- 
tion with others like himself, and by adopt- 
ing the standards of the homosexual group. 
Following this line of thinking, one might be 
led to expect that the person who has many 
homosexual contacts or associations would be 
the most satisfied with his status. The op- 
posite would be expected of the homosexual 
male who is forced to maintain heterosexual 


1 This paper is based upon a master’s thesis sub- 
mitted to the University of Colorado and was par- 
tially supported by a grant from the Graduate 
School of that institution. The data for it were col- 
lected while the author was a United States Public 
Health Fellow under Training Grant M-6613. 


contacts and associations. This reasoning is 
subsumed here under a “role conflict” con- 
cept; role conflict can be said to exist when 
the individual is required by the social de- 
mands of a particular situation to behave in 
a manner incongruent with his normal, self- 
accepting role—assuming, of course, that he 
does accept the role of homosexuality for him- 
self. Conflicts of this nature might be ex- 
pected to exist, for example, in an individual 
who is employed in a job where he must dis- 
play the characteristics of a typical hetero- 
sexual male. 

Besides this sort of frustration and thwart- 
ing that the homosexual male may have to 
face in a very real, objective sense, he may 
also face conflicts which arise in his percep- 
tions of his role. In order to continue build- 
ing a set of logically consistent hypotheses, 
this study made the assumption that the per- 
son feels most comfortable and assured if he 
identifies with the role of the typical homo- 
sexual male, whatever he may perceive this 
role to be. A homosexual can therefore face 
a subjective, cognitive role conflict if he per- 
ceives a greater discrepancy between himself 
and the typical homosexual male than be- 
tween himself and the typical heterosexual 
male. This individual is one who feels that 
the qualities and attributes he possesses are 
closer to those characteristic of the typical 
heterosexual male than those characteristic of 
the typical homosexual male. 

The terms “adjustment” and “maladjust- 
ment” can have a variety of meanings. Per- 
haps more satisfactory operations for these 
terms could be provided by other measures, 
but these were precluded by the scope of the 
present study. Instead, subjective measures, 
relating to the person’s feelings about him- 
self—his status, his sense of well-being, ade- 





116 








Attitudes toward Sex Roles in Homosexual Males 


quacy, etc.—were used. In order to avoid con- 
fusion in terminology, future reference will be 
made to “feelings of adequacy” instead of to 
“adjustment.” 

In this research, it is expected that feelings 
of adequacy will be greater in those who: 

1. Have homosexual contacts and associa- 
tions (“Contacts and associations” here in- 
clude leisure-time activity, “homosexual mar- 
riage,” and membership in homosexual groups 
and organizations.) 

2. Suffer fewer pressures toward hetero- 
sexual behavior and attitudes (‘‘Pressures to- 
ward heterosexual behavior and attitudes” are 
presumed to be found in masculine or con- 
flictful types of emplyoment. A further indi- 
cation of pressures is also inferred from an 
individual’s willingness to reveal his homo- 
sexual status to friends, relatives, and work 
associates, and nonpreferred contact with het- 
erosexuals. ) 

3. Identify with the typical homosexual 
male 

4. Perceive fewer desirable characteristics 
in the role of the typical heterosexual male 

5. Perceive a smaller discrepancy between 
themselves and the typical homosexual male 
than between themselves and the typical het- 
erosexual male 

It should be noted here that the conviction 
with which these hypotheses were made was 
tempered by the fact that little research in 
this area has been done, and that Hooker and 
Bennett’s suggestions, upon which only Hy- 
pothesis 1 is based, do not rest on a large 
body of experimental evidence. Failure to 
verify the hypotheses should not, therefore, be 
necessarily construed as a failure or short- 
coming of the admittedly naive ideas and 
theories on which they are based. 


PROCEDURE 


Subjects. A total of 47 subjects (Ss) was used in 
this study. The larger portion was contacted through 
the cooperation of the Denver and the San Francisco 
Chapters of the Mattachine Society, a national or- 
ganization concerned with problems of sexual adjust- 
ment.? Anonymity of the Ss was preserved by name- 
less, numbered questionnaires. In order to facilitate 
retesting of some Ss at a later date, a contact indi- 

2 The author is grateful to the members and friends 
of this society for their cooperation, and to William 
4. Scott and Evelyn Hooker for their criticism. 


117 


vidual in each of these cities maintained a list of Ss 
and their corresponding questionnaire numbers. Due 
to the nature of the sampling problems involved in 
this type of research, the representativeness of the 
sample is probably as good as can be achieved un- 
der the present circumstances. The contacts were in 
structed to sample as many diverse elements of their 
respective homosexual populations as possible in or- 
der to further the aim of representativeness. 

For the purpose of this study, the criterion of 
homosexuality was defined by the Ss _ themselves 
They were deemed “homosexual” if they were will- 
ing to label themselves as such to the contact person 
Some individuals volunteered the information that 
they were predominantly bisexual, or preferred to 
think of themselves in this manner. However, as far 
as the investigator was able to ascertain, no S claimed 
to enjoy sexual relations with the opposite sex more 
than he enjoyed sexual relations with the same sex 

Ss ranged in age from 21 to 63 years, with a mean 
of 34 years. They had been aware of their own 
homosexuality from 2 to 45 years, and the mean 
time they have been aware of their status is 17 years 
Eleven considered themselves “homosexually mar- 
ried,” but only one was heterosexually married. A 
great diversity of occupations was represented, and 
the residences of the Ss also varied. 

The data for this study were collected by means 
of a paper-and-pencil questionnaire. Fifteen Ss were 
group tested in Denver at a special meeting of the 
Mattachine Society, which publicized the session as 
widely as possible. At this time other blank ques- 
tionaires were handed out to interested individuals 
who thought they could contact friends, acquaint- 
ances, etc. who would be willing to complete the 
test and forward it directly to the writer. Stamped, 
self-addressed envelopes were provided for this pur- 
pose so that the completed questionnaire need not 
pass through the hands of a third party. 

Another 12 questionnaires were individually ad- 
ministered in San Francisco through the aid of the 
contact who is employed as a minister-counselor in 
the Mattachine office of that city. He endeavored to 
give the test to the first 12 homosexuals who came 
to his office. The remaining 20 questionnaires were 
returned from various sources, presumably filled out 
by friends or acquaintances of the original group 
of 15. 

Measurement of Feelings of Adequacy. Feelings of 
adequacy were measured by the following two de- 
vices: 

1. The self-ideal discrepancy. S was asked to rate 
each of 46 traits according to how well it described 
“himself as he is now,” on a seven-point scale with 
1 indicating “exactly like.” This set of ratings de- 
scribed the self-concept, and the same procedure was 
followed in ascertaining the ideal-self-concept, ex- 
cept that S was asked to rate each of the same words 
according to how he “would like to be.” It was as- 
sumed that the larger the sum of the absolute dis- 
crepancies in ratings of these traits under the two 
different sets, the greater the feelings of inadequacy 
in the S. Examples of these traits and the rationale 








118 


TABLE 1 


COEFFICIENTS OF STABILITY 


(N = 19 

Measure r 
Self-Het. Discrepancy 62 
Ideal-Het. Discrepancy 77 
Ideal-Hom. Discrepancy 75 
Self-Hom. Discrepancy .80 
Ideal-Self Discrepancy 86 
Direct Measure of Self-Adequacy .78 


for selecting the particular set used are described in 
more detail in the section on “Measurement of Sub- 
jective Role Conflict.” 

2. The direct measure. Twenty statements of the 
MMPI type, referring to S’s feelings of adequacy, 
were prepared and tested for homogeneity on a pilot 
sample of 45 General Psychology students. S was 
asked to rate each of the statements on a seven-point 
scale according to how much of the time he thought 
it applied to himself. Examples of these statements 
are: “I am entirely self-confident,” “I certainly feel 
useless,” an‘! “I feel that I am a stable person.” 

Measurement of Objective Role Conflict. Objective 
role conflict was said to exist for a homosexual male 
employed in a job where he must display charac- 
teristics of a typical heterosexual male. Thus this 
study assessed Ss’ occupations, and these were rated 
by the investigator and another judge according to 
whether or not they seemed likely to pose conflict 
for the individual. “Conflict” was defined by two 
criteria: requiring the individual to behave in a man- 
ner characteristic of a typical heterosexual male, and 
assumption of this role in a type of employment 
which also required frequent contact with a pre- 
dominantly heterosexual public. A coding of 1 was 
made to indicate absence of conflict, 2 indicated that 
an ambiguous or neutral occupation was held, and 
3 indicated the likelihood of sex role conflict. A self- 
employed bookshop operator, a student, and an artist 
were, for example, rated 1; and a railroad engine- 
man, an engineering geologist, and a lawyer were 
rated 3. 

Measurement of Subjective Role Conflict. These 
measures were developed from lists of the same traits 
as were used to determine the self-ideal discrepancy. 
In other parts of the questionnaire S was asked to 
rate each of the 46 traits according to how well it 
described the “typical male homosexual,” the “typi- 
cal female heterosexual,” and the “typical male het- 
erosexual.” The same seven-point scale was utilized 
as before. The rationale for selecting the specific traits 
used rests upon articles like that of Parsons (1956), 
wherein some suggestions are made concerning dif- 
ferentiating qualities assigned to men and women in 
this culture, and upon intuitive hunches regarding 
traits which may be thought of as characteristic of 
men, women, and male homosexuals. Some of the 


Brenda A. Dickey 


descriptive adjectives were fillers, but the majority 
were selected to represent diverse areas of human be- 
havior, regarded in popular stereotypes as charac- 
teristic of these three groups. Some illustrative ex- 
amples are: “able to get along with everybody,” “ag- 
gressive,” “ambitious,” “creative,” “flighty,” “fault 
finding,” “intellectual,” “irresponsible,” “mature,” 
“self-centered,” “sociable,” and “talented.” 
Reliability of the Measures. Nineteen of the origi 
nal Ss were recontacted and administered a shortened 
form of the questionnaire approximately 3.5 to 4 
months after the original test. Coefficients of sta- 
bility were then calculated for the discrepancies be- 
tween the self-concept and the typical heterosexual 
male, the ideal-self and the typical heterosexual male, 
the ideal-self and the typical homosexual male, the 
self and the typical homosexual male, and the ideal- 
self and the self. A coefficient of stability was also 
calculated for the direct measure of adequacy. These 
product-moment correlations are shown in Table 1 
Split-half reliability coefficients were calculated on 
the total original sample for the two measures of 
adequacy. These, corrected for length by the Spear- 
man-Brown formula, are .96 for the self-ideal dis 
crepancy, and .91 for the direct measure 
Indications of the Validity of the Measures. An 
indication of the validity of the two measures of self 
adequacy was arrived at by correlating the scores be 
tween the direct measure and the self-ideal discrep- 
ancy. The obtained product-moment r of .53 was 
judged sufficient to allow the use of both as repre- 
senting feelings of adequacy in this study. 
Some support for the validity of certain of the 
other discrepancy measures is also provided by data 


TABLE 2 


MEAN RAw Score DISCREPANCIES 

THE TypicAL HomosexuAL MALE, HETEROSEXUAL 
FEMALE, AND HETEROSEXUAL MALE. MEAN 
Raw Score DIsCREPANCIES FOR THE TYPICAL 


FOR THE SELF LEss 


HomosexuAL MAtLe Less THE TyPICAL 
HETEROSEXUAL FEMALE AND 
HETEROSEXUAL MALE 

Discrepancy Mean SD is 
Self-Hom 61.6 21.7 
2.79* 
Self-Fem 68.3 23.9 
— 3.96** 
Self-Het. 75.8 27.8 
Hom.-Fem 57.6 25.6 
—6.26** 
Hom.-Het. 73.2 28.2 
Note Total N =47. One S was excluded from the Hom- 
Het analysis because he obtained liscrepancy score of 0. It 
is assumed that he misunderstood the instructions, so that the 
ratings he gave to the typical I exual male were duplicated 
on the page which asked for rat the typical heterosexual 
male 


*¢ of correlated difference 
*»<.01, two-tailed test 
** » <.001, two-tailed te 








{ttitudes toward Sex Roles in Homosexual Males 


from the original sample of homosexuals. Mean raw 
score discrepancies were calculated for the following 
the self-typical homosexual male (self-hom.), the 
self-typical heterosexual female (self-fem.), the self 

typical heterosexual male (self-het.), the typical 
homosexual male-typical heterosexual female (hom 

fem.), and the typical homosexual male-typical het- 
erosexual male (hom.-het.). These differences be- 
tween mean discrepancies were then tested for sig- 
nificance (Table 2). It is readily apparent that the 
average self is seen as more like the typical homo- 
sexual male than like the typical heterosexual fe- 
male, and more like the typical heterosexual female 
than like the typical heterosexual male. It is also ap- 
parent that the typical homosexual male is perceived 
as more like the typical heterosexual female than like 
the typical heterosexual male. These results corre- 
spond with common assumptions concerning the rela 
tive similarities of homosexuals and typical males 
and typical females; hence, they lend a certain de 
gree of confidence to the discrepancy measures on 
which they are based. 


RESULTS 


The major portion of the data analysis was 
carried out in the following manner. The raw 
self-adequacy scores on both measures and 
the various raw discrepancy scores were or- 
dered in a frequency distribution and in- 
spected to find equal intervals which would 
cover all the distributions of the total num- 
ber. of discrepancies to be used in the analy- 
sis. It was decided to use six intervals, in or- 
der to facilitate punching the data into a 
single IBM card column, and yet allow a 
sizeable number of Ss in each group. Unless 
otherwise noted, all analyses to be reported 
in this section are based on these group scores 
rather than on the raw scores. It should also 
be noted (a) that the smaller the group scores, 
e.g., 1 as opposed to 6, the smaller the abso- 
lute discrepancy, and (4) that in the case of 
the self-adequacy measures, the smaller the 
group scores, the greater the feelings of ade- 
quacy. 


Results Relating to Objective Role Conflict 


Hypothesis 1 stated that the more homo- 
sexual contacts and associations participated 
in by the S, the greater would be his feelings 
of adequacy. To test this, the following op- 
erations were performed. Mean feelings of 
adequacy on both measures were calculated 
for: 

1. Ss associating predominantly with other 


119 
TABLE 3 
MEAN FEELINGS OF ADEQUACY FOR SUBJECTS WHO 
Are Errner “HomosexXuALLY MARRIED” OR 
“HOMOSEXUALLY UNMARRIED” 
N Mean SD t 
Direct Measure 
Homosexually Married 11 2.82 1.47 1.149 
Homosexually Unmarried 36 3.36 1.02 
Self-Ideal Discrepancy 
Homosexually Married 11 2.18 1.33 > 
. TT . e e ~ 2.956* 
Homosexually Unmarried 35 3.51 = 1.27 


*p< .O1, two-tailed test 


homosexuals and those associating predomi- 
nantly with heterosexuals 
2. Ss belonging to homosexual social groups 


or organizations, and those belonging to 
neither 
3. Ss who considered themselves homo- 


sexually married and those who did not 

No significant differences were found in the 
first two operations; however, the differences 
between the mean self-adequacy scores reached 
significance on one measure in the third op- 
eration (Table 3). Apparently homosexual 
males can be said to feel more adequate if 
they are homosexually married. 

To test Hypothesis 2 that feelings of ade- 
quacy will be greater in Ss who suffer fewer 
pressures toward heterosexual behavior and 
attitudes, the following operations were per- 
formed. Mean feelings of adequacy on both 
measures were calculated for: 

1. Ss who were rated as being in noncon- 
flictful jobs, ambiguous jobs, and conflictful 
jobs 

2. Ss expressing much satisfaction (rating 
of 1) with nonconflictful jobs, ambiguous 
jobs, conflict jobs, and for Ss expressing less 
satisfaction (ratings of 2, 3, and 4) with the 
three job categories 

3. Ss expressing preference for contact with 
homosexuals and actual leisure-time associa- 
tion predominantly with heterosexuals, pref- 
erence for contact with homosexuals and 
actual leisure-time association predominantly 
with homosexuals, preference for contact with 
heterosexuals and actual leisure-time associa- 
tion perdominantly with heterosexuals, and 











120 Brenda 
TABLE 4 


MEAN FEELINGS OF ADEQUACY FOR SUBJECTS EXPRESS 
ING MucH OR Less SATISFACTION WITH NONCONFLICT 
Joss, AmpBiGuous Joss, AND CONFLICT Joss 


Satis. 


Rating N Mean SD t 


Direct Measure 


Nonconflict 1 8 2.25 047 , ai 
Jobs 4 11 3.73 135 3564 

Ambiguous 1 4 2.50 1.00 , 547° 
Jobs 24 13 385 068 *>* 

Conflict 1 7 2.86 1.35 exe 
Jobs 2-4 4 3.25 095 ~~ 

Self-Ideal Discrepancy 

Nonconflict 1 7 2.71 0.95 1.850 
Jobs 2-4 11 3.82 1.60 ns 

Ambiguous 1 5 1.80 0.84 3.250** 
Jobs 2-4 is 342 1D 

Conflict 1 6 2.67 0.81 1.383 
Jobs 2-4 4 3.50 1.00 ae 


* p< .05, two-tailed test. 
** p< .01, two-tailed test. 


preference for contact with heterosexuals and 
actual leisure-time association predominantly 
with homosexuals 

4. Ss who had revealed or not revealed their 
status as homosexuals to work associates, 
friends, and family 

No significant differences were found for 
Operations 1 and 4. In Operation 2, it will be 
seen (Table 4), that the only significant dif- 
ferences between the means of the various 
groups were obtained when comparing the 
satisfied Ss with the less satisfied Ss. Feelings 
of adequacy were then correlated with degrees 
of job satisfaction, and the obtained product- 
moment r’s of .479 (p< .01) for the self- 
ideal discrepancy, and .380 (p < .01) for the 
direct measure, led us to conclude that the 
more satisfied the homosexual male is with his 
job, the more likely he is to feel adequate. 

In operation 3 the hypothesized differ- 
ences were not obtained. Instead it appears 
(Table 5) that the homosexual males who 
prefer leisure-time contact with heterosexuals 
tend to feel more adequate than those who 
prefer homosexual associations, regardless of 
their actual pattern of contact. This result is 


A. Dickey 


significant for the self-ideal discrepancy, but 
not for the direct measure of self-adequacy. 


Results Relating to Subjective Role Conflict 


To test Hypothesis 3 that feelings of ade- 
quacy will be greater in those Ss who identify 
with the role of the typical homosexual male, 
adequacy scores on both measures were cor- 
related with the self-typical homosexual male 
discrepancy. The prediction was made that 
the smaller this discrepancy, the greater would 
be S’s feelings of adequacy, and hence a 
smaller self-adequacy score would be ob- 
tained. The obtained correlations (.071 and 
—.130), however, did not support the hy- 
pothesis. 

To test Hypothesis 4 that feelings of ade- 
quacy will be greater in those Ss who perceive 
fewer desirable characteristics in the role of 
the typical heterosexual male, adequacy scores 


TABLE 5 


MEAN FEELINGS OF ADEQUACY FOR SuBJECTs WHo 
PREFER CONTACT WITH HOMOSEXUALS OR HETERO 
SEXUALS AND WHO ACTUALLY ASSOCIATE PRE 

DOMINANTLY WITH HETEROSEXUALS 
or HoMOSEXUALS 


N Mean SD t 


Direct Measure 


Prefer Homosexuals 
Associate with Homosexual 


on 0.825 
Irate ate owe 
I refer Heterose xuals ss 280 1.30 
Associate with Homosexua 
» > > 7.) © 
Prefer Homosexuals 8 3.62 1.45 
Associate with Heterosexuals 
1.430 
Irefe »te > ale 
I refer Heterose xual 5 260 0.89 
Associate with Heterosexuals 
Self-Ideal Discrepancy 
vale ce ale 
Prefer Homosexual 24 3.42 1.50 
Associate with Homosexuals 
2.040 
rete *terose s 
Prefer Heter« exual «5 240 0.89 
Associate with Homosexua 
Prefer Homosexuals 375 1.28 
Associate with Heterosexuals 
' 3.305* 
Prefer Heterosexuals - 
€ eterosexua 5 1.20 0 84 


Associate with Heterosexuals 


Note.—Total N = 47 for dire 
discrepancy. Four Ss were ex 
stating they spent equal ar 
and homosexuals. 

* p< .01, two-tailed test 


easure and 46 for self-ideal 
luded from the analysis for 
time with heterosexuals 








Attitudes toward Sex Roles in Homosexual Males 121 


on both measures were correlated with the 
ideal-heterosexual male discrepancy. The pre- 
diction was made that the larger this discrep- 
ancy, the greater would be S’s feelings of ade- 
quacy. The obtained correlations were .320 
(p < .02) for the self-ideal discrepancy, and 
.243 (p < .05) for the direct measure. Con- 
trary to prediction, it thus appears that the 
Ss who perceive more desirable characteristics 
in the role of the typical heterosexual male 
tend to feel more adequate. 

To test Hypothesis 5 that feelings of ade- 
quacy will be greater in those Ss who perceive 
a smaller discrepancy between themselves and 
the typical homosexual male than between 
themselves and the typical heterosexual male, 
adequacy scores were correlated with the dif- 
ference between the self-heterosexual male 
discrepancy and the self-homosexual male dis- 
crepancy. A negative correlation was thus pre- 
dicted. Although only one measure of ade- 
quacy reached a satisfactory level of signifi- 
cance, both correlations were in the opposite 
direction to that originally hypothesized: .249 
(p < .05) for the self-ideal discrepancy, and 
.149 for the direct measure. So it appears that, 
if anything, feelings of adequacy are greatest 
in these Ss if they see themselves as more like 
the typical heterosexual male than like the 
typical homosexual male. 


DISCUSSION 


Assuming that self-adequacy has been val- 
idly assessed by either or both methods, it 
seems that the homosexual males feel ade- 
quate if they act in conformance with prevail- 
ing cultural standards regarding the desir- 
ability of adhering to the prescribed sex role. 
They are not only more self-adequate if they 
perceive themselves as being closer to the 
typical heterosexual male than to the typical 
homosexual male, but they also feel more ade- 
quate if they admire the characteristics and 
traits ascribed to the heterosexual male. Re- 
gardless of actual patterns of association, the 
adequate homosexual male prefers to be with 
heterosexuals in his leisure time rather than 
with other homosexuals. Although job satis- 
faction is positively related to feelings of ade- 
quacy in these individuals, this finding is 
probably not unique to homosexuals, but ap- 
plies to other persons as well. In fact, the only 


finding contradictory to this reasoning is that 
homosexually married Ss were found to feel 
more adequate than the nonmarried and this 
result can possibly be explained by regarding 
the inadequate homosexual male as one who 
is incapable of entering such an intimate re- 
lationship. The adequate homosexual male, on 
the other hand, is able to endure the more or 
less permanent company of another homo- 
sexual—especially if the latter also feels self- 
adequate—and thus is more apt to marry 
homosexually than is the inadequate homo- 
sexual male. 

Brown’s distinction (1957) between inver- 
sion and homosexuality may conceivably be 
useful in further describing the results of this 
study. He characterizes the homosexual as an 
individual who “desires and/or obtains pre- 
dominant or exclusive sexual satisfaction with 
members of his own sex” (p. 614). Moreover, 
the homosexual also probably admires and 
identifies with the typical heterosexual male. 
The invert, however, is not only very likely 
to prefer this type of sexual gratification, but 
also “is one whose thoughts, perceptions, atti- 
tudes, fantasies, feelings, preferences, inter- 
ests, and behavioral tendencies are typical of 
the opposite sex” (p. 614). 

If the Ss of this study can be said to in- 
clude both inverts and homosexuals in Brown’s 
sense, and if the inadequate individuals do 
feel themselves to be more like the typical 
homosexual male (which role, as has been 
found, is more like the typical heterosexual 
female than like the typical heterosexual 
male), we may tentatively assume that these 
inadequate Ss tend to be like Brown’s “in- 
verts.”’ Also, since the adequate homosexuals 
tend to admire the traits and characteristics 
of the typical heterosexual male and feel they 
are closer to this role than to the typical 
homosexual male, it may further be inferred 


that these persons are most like Brown’s 
“homosexuals.” 
Newcomb’s comments (1950) are felt to 


give even more pertinent support and clarifi- 
cation to this study’s findings. He states that 
the individual who finds the prescribed sex 
role uncongenial is subject to “threat and in- 
security” because deviation from this role “is 
readily detectable and therefore readily pun- 
ishable” (p. 418). “Deviancy,” in this study, 





122 Brenda 
then takes on two meanings: “deviancy”’ in 
the sense that all Ss have departed from the 
cultural norm of preferring a heterosexual fe- 
male for a sex partner; and “deviancy” in the 
sense that some Ss have also chosen to reject 
the prescribed sex role—that of the typical 
heterosexual male. 

Since a comparison of mean self-adequacy 
scores on the direct measure for this homo- 
sexual male sample vs. a random sample of 
male students at the University of Colorado 
had yielded no statistically significant differ- 
ence, we are led to conclude that the former 
meaning does not necessarily lead to feelings 
of inadequacy, while the latter meaning does, 
since this research has finally found that those 
homosexual males who see the prescribed sex 
role as uncongenial are those who are also in- 
adequate. But those homosexual males who 
do adhere to the cultural standards of feel- 
ing, perceiving, emulating, and idealizing the 
typical heterosexual male are more likely to 
feel self-satisfied and adequate. 

Of course, all of these tentative interpreta- 
tions must be dealt with cautiously. The sam- 
ple used in this study was in no way ran- 
domly selected, and the generalizability of the 
results to the total homosexual male popula- 
tion is, therefore, questionable. 


SUMMARY 


This research was a study of feelings of 
adequacy in homosexual males. Designed to 
test a set of simple hypotheses, the study 
focused on objective and subjective role con- 
flict as possible factors relevant to homo- 
sexual males’ feeling of adequacy. 

Forty-seven homosexual males were anony- 
mously assessed by means of paper-and-pencil 
questionnaires. Feelings of adequacy were 
measured by: the discrepancy between the 
self- and the ideal-self-concepts on a list of 
46 traits deemed relevant to the homosexual 
male’s status; and responses to a list of 20 


%The University of Colorado section was 
studied by Robert Kassebaum and Leon Rappaport, 


under the direction of William A. Scott. 


cross 


A, 


Dickey 


MMPI-type statements presumably reflecting 
feelings of adequacy. The correlation between 
these two measures of adequacy was .53. In- 
formation relative to objective role conflict 
was obtained through answers to questions 
regarding the subject’s status from which one 
could infer pressures toward heterosexual be- 
havior and attitudes. Subjective role conflict 
was assessed by means of ratings assigned to 
the same 46 traits as were utilized in the self- 
ideal discrepancy, with separate ratings ob- 
tained for the “typical homosexual male,” the 
“typical heterosexual female,’ and the “typi- 
cal heterosexual male.” 

The findings did not, in general, support 
the role conflict hypotheses. Instead, the re- 
sults could more readily be interpreted as in- 
dicating that subjectively adequate homo- 
sexual males were those who tended to identify 
with the masculine norms of the dominant 
culture. Feelings of adequacy were associated 
with: job satisfaction, preference for leisure- 
time association with heterosexuals, idealiza- 
tion of the role of the typical heterosexual 
male, and identification with the typical het- 
erosexual male rather than with the typical 
homosexual male. 

Since this was not 
homosexual males, it 
caution be used in 
results. 


a random sample of 
recommended that 
generalizing from these 


1S 


REFERENCES 


Bennett, E. H. The social aspects of homosexuality 
Med. Pract., 1947, 217, 207-210 

Brown, D. G. The development of sex-role inversion 
and homosexuality. J. Pediat., 1957, 50, 613-619 

HOOKER, analysis of group 
behavior Psychol., 1956, 42, 


217-225. 


EVELYN. 
of 


A preliminary 
homosexuals J 


Hooker, Evetyn. The adjustment of the male overt 
homosexual. J. proj. Tech., 1957, 21, 18-31 

Newcoms, T. M. Social ps) New York 
Dryden, 195¢ 

Parsons, T. Age and sex in the social structure of 
the United States. In C. Kluckhohn & H. A. Mur- 
ray (Eds.), Personality in nature, and 


culture. New York: Knopf, 1956. Pp 


hology 


society, 


363-375 


(Received February , 1960) 





Journal of Consu 
1961, Vol 5 


lting Psychology 
12 128 


A SECOND VALIDATION OF A LONG-TERM 
RORSCHACH PROGNOSTIC INDEX FOR 
SCHIZOPHRENIC PATIENTS’ 


ZYGMUNT A. PIOTROWSKI 


anD BARRY BRICKLIN 


Jefferson Medical College of Philadelphia 


In 1952 a group of Rorschach prognostic 
signs were presented with which the follow-up 
conditions of schizophrenic patients could be 
predicted (Piotrowski & Lewis, 1952). An es- 
sentially postdictive methodology was used, 
i.e., the prognostic signs were stated after the 
follow-up groupings had been formed. In 
1958 these signs were applied to 30 schizo- 
phrenic patients, some of whom improved 
over an interval of at least 3 years and some 
of whom remained unimproved over the same 
interval. On the basis of this application the 
signs were revised (in order to increase reli- 
ability) resulting in the 1958 prognostic in- 
dex. The 1958 prognostic index was then vali- 
dated on a group of 70 schizophrenics (Pio- 
trowski & Bricklin, 1958). The purpose of the 
current investigation was to test the validity 
of the index on a group of patients who dif- 
fered in many important respects from the 70 
used in 1958. The prognostic index was again 
revised slightly, and was revalidated on 103 
additional schizophrenic patients (the 1959 
group). This slightly revised version was then 
reapplied to the 1958 sample so as to facili- 
tate comparisons between the two groups. It 
should be kept in mind that the 1959 re- 
vision did not change the cutoff point, and 
the relative distribution of the 1958 cases has 
not changed in any way, i.e., the two samples 
constitute independent validating evidence for 
the prognostic index. 


1 The authors wish to thank Clellen Morgan and 
Robert Ballard of the Veterans Administration Re- 
gional Office, Philadelphia, and David Cohen of the 
Veterans Administration Hospital, Coatesville, Penn- 
sylvania, for their invaluable assistance in making 
this investigation feasible. This study was supported 
by a grant from the Research Committee of the Su- 
preme Council, thirty-third degree, Scottish Rite 
Freemasonry, Northern Masonic Jurisdiction 


METHODOLOGY 


The essential validating methodology in both the 
1958 and the 1959 samples was to compare predic 
tions made on the basis of the Rorschach prognostic 
index against follow-up statements made on the basis 
of independent clinical judgments 

In the selection of cases, a rule of inclusion was 
that there be copious follow-up data on each schizo- 
phrenic patient, consisting of psychiatric .interviews, 
psychiatric worker interviews, interviews of 
the patient’s family members by psychiatric 
workers, and staff conference reports. This informa- 
tion had to extend at least 3 years past the time at 
which a Rorschach test had been administered. It 
was to this Rorschach test that the prognostic index 
was applied. The actual year in which the Rorschach 
had been given was unimportant so long as there 
was follow-up data at least 3 years subsequent to 
this year. The other standard of inclusion was that 
each patient be independently 
phrenic. 

The first step in the procedure was to 
through the psychological test files and locate all 
schizophrenic cases to whom Rorschach tests had 
been administered at least 3 years ago. The second 
step was to see if there was extensive follow-up data 
on all such cases which extended at 
subsequent to the time of Rorschach testing. The 
third step was to insure that each case had main 
tained the diagnosis of schizophrenia over the entire 
interval: for which information was available. The 
same clinicians who made the follow-up designations 
(see below) made all decisions to include or exclud: 
cases. All such decisions were made without know] 
edge of Rorschach results. Only one-fourth of the 
Rorschach records taken at least 3 years earlier could 
be used. The remaining three-fourths of the cases did 
not have adequate follow-up information 

The methodology consisted of comparing predic- 
tions made on the basis of the Rorschach prognostic 
index against follow-up statements as designated by 
experienced clinicians working independently, using 
the clinical and life history data enumerated above 
(Lewis & Piotrowski). On the basis of this follow- 
up information each patient was designated as im- 
proved or unimproved over the x-year interval, 
where x was at least 3. The prognostic index was 
applied to the Rorschach tests each one of which 


social 


social 


diagnosed as schizo 


search 


least 3 years 


123 





124 Zygmunt A. 


least years prior to the 


had been administered at 
time to which the follow-up extended. All Rorschach 
only; the 


were identified by number rater 
(Bricklin) had no other information at his disposal 
This design eliminated the possibility of contaminat- 
ing factors. The rater who applied the prognostic 
index had no data as to the follow-up conditions of 
the patients; the clinicians who made the follow-up 
designations had no Rorschach data. A prediction 
(improved, unimproved) based on the Rorschach 
prognostic index was made for each case. A score of 
+2 or more would predict the patient to remain un- 
improved; +1 or less would indicate the patient to 
be improved. These predictions were then confronted 
with the independent follow-up designations (im- 
proved, unimproved) and the two results were com 
pared by the chi square technique. 

The follow-up length (at least 3 years) was used 
not only in order to make possible a meaningful 
statement as to follow-up condition but to insure 
the correctness of the initial diagnosis. Since our 
methodology demanded that there be extensive fol- 
low-up data on each patient, the chance of our hav- 
ing included a nonschizophrenic patient was, for all 
intents and purposes, ruled out 

The 1958 and the 1959 samples of schizophrenic 
patients were chosen so as to differ from each other 
in age, intelligence, sex composition, socioeconomic 
status, and duration of time elapsed between onset 
of manifest psychosis and Rorschach examination 
time. The purpose was to validate the prognostic 
index on as wide a range of schizophrenic patients 
as possible. 


tests 


Follow-Up Designations 


The same clinical follow-up criteria of improve- 
ment and unimprovement were used in both the 
1958 and the 1959 groups. Every available source of 
information—psychiatric interviews and evaluations 
plus psychiatric social work reports, family reports 
on patients’ adjustment and behavior, and staff con- 
ference notes—was scrutinized. The patient had to 
improve in all three of the following areas to bi 
designated as improved. If he failed to improve in 
all three areas or grew worse, he was classified as 
unimproved. The three areas are: 

1. Thought The relevance, coherence, 
sense of reality, comprehensiveness, consistency, con 
fidence, and valid self-criticism in making 
ments, were considered 

2. Psychosocial relations and work. The capacity 
of the patient to form meaningful emotional rela 
tions with others was considered. The employment 
record of each patient was analyzed from two stand- 
points: as an additional check on his capacity to live 
with others, and to yield information on his ability 
to do some kind of work reasonably effectively. On 
hospitalized patients the hospital work history was 
considered. The patient had to display an increased 
capacity for productive work and for meaningful 
and more constructive interhuman relations to be 
designated as improved in this area 


processes. 


State- 


Piotrowski and Barry Bricklin 


rABLE 1 


POPULATION CHARACTERISTICS OF 1958 


Group AND 1959 Group 


1959 Group 

Variables (J 7 Vv = 103) 
Being 

Compared 


Mean Mean SD 


Age 
1Q 


Follow-up 
Interval 


3. Attitude towards self. This refers to the degree 
of anxiety and to the degree of self-acceptance. To 
be designated as improved in this area, the patient 
had to be more comfortable with himself, and had 
to give evidence of feeling in a realistic way that his 
life had become less troublesome and less difficult. 

It is important that two points be kept in mind 
when follow-up designations are to be made. (a) 
Some symptoms of schizophrenia generally persist 
even in improved cases. There is generally some de 
gree of affective blunting 
thinking generally even in improved cases 
(b) When dealing with one of those patients termed 
“episodic” by Bleuler (195 it is important not to 
consider daily (or even hourly) fluctuation in condi 
tion as essential change. This is especially true of 
manic or depressive mood phases. These mood phases 
are almost always transient. An inspection of the en 
tire duration of the follow-up interval must be made 
until its trend is revealed. With episodic patients th 
frequency of lucid intervals, and the degree of de 
fect shown in the lucid intervals, 
ing criteria. 


Some traces of delusional 
persist 


become the decid 


1958 Sa m ple 


It had been possible to isolate from our psycho- 
logical test files 70 cases wh 
inclusion, i.e., patient diagnosed schizophrenic, Rot 
schach test available, follow-up information extend 
ing at least 3 years subsequent to Rorschach, diag 
nosis of schizophrenia maintained over entire in 
terval for which information was available. The 
meet in this group as 


ich met the conditions of 


condition most difficult to 
well as in the 1959 group was that of obtaining huge 
masses of follow-up information so that an accurate 
appraisal as to the nd of each patient’s illness 
could be made 

As shown in Table 1, the intellectual level 
was above average in the 1958 group, with a mean 
Wechsler-Bellevue IQ of 118, the standard deviation 
being 17. Nearly all of the patients were first admis 
sions and the Rorschach tests to which the 
were applied had been administered 2 months to 2 
years after the onset of manifest psychoses. The dura 


average 


signs 





Long-Term Rorschach Prognostic Index 


tion of this particular interval was gathered from the 
follow-up sources listed above and did not neces 
sarily correspond to the duration of hospitalization 
The mean age of the 1958 group was 28 years, the 
standard deviation being 8.4 years. All of the pa 
tients in this group belonged to the middle and high 
middle socioeconomic classes. There were 29 men and 
41 women in this sample. The mean follow-up in- 
terval (time elapsed between that point at which 
the Rorschach was administered and the time to 
which the patient was followed) was 6.0 years, the 
standard deviation 3.7 years. 


1959 Sample 


All patients met the conditions of inclusion as out- 
lined previously. These patients were chosen so as to 
differ in the above mentioned respects from the 1958 
group. To satisfy these requirements the patients 
were selected from a Veterans Administration Re- 
gional Office (53 patients) in Philadelphia and from 
the Veterans Administration Hospital (50 patients) 
at Coatesville, Pennsylvania. The 103 veterans in 
this group had become manifestly psychotic from 2 
to 10 years prior to the time at which they were 
administered the Rorschach test. At the time of ex- 
amination their mean Wechsler-Bellevue IQ was 97, 
the standard deviation 15.1 as is shown in Table 1 
The mean age was 34 years, the standard deviation 
7.2. The 1959 group contained 97 men and 6 women 
The differences between the 1959 group and the 1958 
group in IQ and age are statistically significant (> < 
.01) as are the differences in interval between onset 
of manifest psychosis and psychological examination, 
and sex composition. Practically all members of the 
1959 sample belonged to a socioeconomic class below 
the middle class. The mean féllow-up interval in 
this group was 6.5 years, the standard deviation 2.8 
The durations of times over which the patients were 
followed subsequent to their Rorschach tests did not 
differ significantly in the two groups 


RESULTS 
Validation 


1958 Sample. There was one disagreement 
among the two clinicians as to follow-up desig- 
nations. This difference was reconciled in open 
conference until one decision was reached for 
the case (for design purposes). 

It had been hypothesized that high index 
scores would be associated with failure to im- 
prove, and low index scores with improve- 
ment. On the basis of the previous investiga- 
tions, +2 and more was chosen as the cutoff 
point that would distinguish the unimproved 
patients from the improved patients. In the 
1958 validation group (Piotrowski & Bricklin, 
1958) ,’49 patients obtained scores of at least 
+2 points; 45 of them were worse or did not 


rABLE 2 


Cut SQUARE ANALYysiIs OF PROGNOSTIC 
Scores OBTAINED IN 1958 Group 


INDEX 


Follow-up Status 


Index Scores Improved Unimproved Total 


2 and more 45 49 
1 and less ; 3 21 


Total 48 


f=1, x? = 41.03, p< .01 

change during the follow-up period, i.e., 45 
of these cases had been independently desig- 
nated as unimproved. Of the 21 patients with 
scores of less than +2 points, 18 were inde- 
pendently designated as improved. The prog- 
nostic index correctly predicted 90% of the 
70 patients’ follow-up conditions. As may be 
noted on Table 2, the chi square value of 
41.03 (df= 1) indicates that such results 
could occur by chance less than 1 in 100 
times. 

1959 Sample. There were two disagreements 
between the two clinicians as to follow-up 
designations. As above, these differences were 
reconciled in open conference until one de- 
cision was reached for each case. Seventy- 
two schizophrenic patients attained scores on 
the prognostic index of at least +2 points; 
70 of these patients had been independently 
designated as unimproved. Thirty-one pa- 
tients attained index scores of less than +2 
points; 22 of these patients had been desig- 
nated as improved. The chi square value of 
56.66 (df = 1) indicates that such an asso- 
ciation between the index scores and follow-up 
conditions could have occurred by chance less 
than 1 in 100 times. Thus the follow-up con- 
ditions of 89% of the patients had been cor- 
rectly predicted by the prognostic index in 
the 1959 group (see Table 3). 


Comparison of Results 


Minor changés have been made in the pres- 


entation of the 1959 data (from the 1958 
data) in the grouping of several signs and in 
the weightings of two. Signs 3 and 6, and 
Signs 5 and 7, were combined because of 
the relatively low frequency of occurrence of 





Zygmunt A. 


rABLE 3 


Cui SQUARE ANALYsis OF PRroGNosTiIC INDEX 


ScorES OBTAINED IN 1959 Group 


Follow-up Status 


Index Scores Improved Unimproved Total 


2 and more 70 
1 and less 


Total 


= 56.66, p< 


Signs 6 and 7. The weightings of Signs 2 and 
combined 3 & 6 were lowered from 3 points 
to 2 points. This revised index was applied 
to both the 1958 and the 1959 validation 
groups in order to make the results directly 
comparable. Neither the original validity of 
the single signs nor of the original prognostic 
conclusions in the 1958 group was affected in 
any way. The same cutoff point of +2 was 
used with both groups. The weighting changes 
did not affect the relative distribution of 
cases in the 1958 sample. 

As can be seen, the prognostic index suc- 


cessfully predicted the follow-up conditions of 

90% of the patients in the 1958 group, and 

89% of those in the 1959 veteran group. 
The variables in which the two validation 


groups differed—intelligence, age, etc.—did 
not affect the validity of the prognostic in- 
dex. There is a tendency among the veterans 
(the 1959 group) to show a somewhat lower 
incidence of improvement in the low score 
group. In the 1959 veteran group, of the 31 
persons obtaining a prognostic index score of 
1 or less, 71% actually were improved. In 
the 1958 group, of the 21 patients obtaining 
the same score, 86% improved. This finding, 
which may be related to differential treat- 
ment procedures, remains to be investigated. 

The 1959 veteran group was composed of 
ambulatory or milder VARO cases (N = 53), 
and more severe VAH cases (N = 50). The 
1958 group (NV = 70) falls between these two 
other groups in terms of severity of illness; 
these patients were hospitalized but were 
early and mild cases at the time. It is inter- 
esting to note that the mean prognostic index 
scores were 2.8 (SD = 3.3) in the VARO 


Piotrowski and Barry Bricklin 


cases; 3.9 (SD = 4.3) in the 1958 group; 
and 5.8 (SD = 4.0) in the VAH group. The 
mean prognostic index scores reflect the in- 
creasing severity of defect in the three groups. 
The difference in the prognostic index scores 
of VARO and VAH is significant at 
p < .01 level; that between the 1958 group 
and the VAH cases at p < .02. The difference 
in scores between the 1958 group and the 
VARO cases falls at the p< .10 level (¢ 
tests). 

Two of the signs, 3 & 6 combined and 4, 
were more frequent in the 1958 group. Thus 
it is possible that these signs are related to 
intelligence. The appearance of these signs 
apparently requires on the part of the pa- 
tient a critical attitude toward thinking and 
some facility in verbalizing thoughts, with the 
concomitant condition that these thoughts and 
their evaluations be defective. However, the 
habit itself of thinking and speaking in terms 
of probabilities rather than certainties (“could 
be, I don’t know”; “might be anything that 
has a shape”; etc.) is correlated with intelli- 
gence and therefore shows up to a greater 
extent in the brighter 1958 group. Signs 11 
(determinant scarcity) and 12 (content mo- 


Cases 


TABLE 4 


THEIR WEIGHTING 
VALIDITY 


PROGNOSTK 
AND Cur SQUARI 


N = 173 


THE SIGNS, 


Human Moveme 
Sum Color Res] 
at least 3 (4 


Response repetit 


Vagueness of perce 
appropriate conce 


Indeterminate for 


attitude or blur 
magination and 


Breakdown of interpret 
ring of differe 


nce bet wee! 
sensation 
Absurdly inconcl 
Absence of humar 
F +% below 60 
Determinant scar 
Content ynotor 


No Human M« 


At least 5 Humar 


For a more de 
ng examples of each, the 


Bricklin (1958) 


* The weightings are giver theses 
tailed description of each sigt 


reader is referred to Piotrow 








Long-Term Rorschach Prognostic Index 1 


notony) were more frequent in the less bright 
1959 veteran group. This would be expected. 
As a rule, the variety of Rorschach compo- 
nents decreases with decreasing intelligence. 
By retaining signs which appear with differ- 
ing frequency in varying populations we are 
able to apply the same set of signs to differ- 
ent types of schizophrenics. 

The prognostic validity of each sign was 
measured by the chi square technique. This 
was done for the 1958 and the 1959 groups 
separately as well as for the entire sample. 
Using the entire sample of 173 cases, a four- 
cell contingency table was formed: the im- 
proved and unimproved patients formed one 
dimension, those manifesting and those not 
manifesting the sign the other dimension. 
The results are given in Table 4. The fol- 
lowing signs differentiated between the im- 
proved and unimproved cases at the » < .01 
level: 1, 5 & 7, 9, 10, 11, 12, 13, and 14. 
Three signs differentiated at the p< .02 
level: 3 & 6, 4, and 8. Sign 2 differentiated 
at the p < .05 level. 


Reliability 


The reliability of the prognostic index was 
tested independently of the main study by 
having five raters apply the index to each of 
10 schizophrenic cases, and by having an- 
other rater and one of us (Bricklin) inde- 
pendently apply the index to 25 schizophrenic 
cases. 

Five raters were given seven unimproved 
and three improved schizophrenic Rorschach 
records. These records were chosen at ran- 
dom from among improved and unimproved 
cases in proportion to the rate at which im- 
proved and unimproved schizophrenic cases 
appear in the general schizophrenic popula- 
tion. The raters, of course, had no knowledge 
of this decision rule. 

There was no disagreement in the case of 
eight patients whom all five raters placed in 
the same group, improved or unimproved. 
One rater disagreed with the rest of them by 
placing one unimproved patient among the 
improved; and one rater disagreed with his 
fellows by placing an improved patient among 

2 Signs 3, 5, 6, and 7 were all individually valid at 


at least the p < .05 level, as they are in combination 


Nm 
~! 


the unimproved. Out of a total of 50 predic- 
tions, only 2 were incorrect as to prognostic 
conclusion. On four patients, all five raters 
had identical prognostic index scores. On four 
other patients the greatest difference in scores 
among all five raters was only two points. On 
the remaining two patients, the highest and 
lowest scores differed by four points; how- 
ever, this difference was critical in only one 
case by leading to a prognostic conclusion op- 
posite to that of the majority of raters. 

In addition, one other rater (Carter Zelez- 
nik) applied the index to 18 unimproved and 
7 improved cases chosen in the same manner 
as above. These same cases were independ- 
ently scored by one of us. The prognostic in- 
dex scores differed by two points in four 
cases, and by one point in one case. In only 
one case, however, did the difference (of two 
points) influence the prognostic conclusion. 
The reliability can be considered satisfactory. 


DISCUSSION 


Long experience has shown, as Bleuler 
(1950) noted, that a schizophrenic frequently 
undergoes marked and unpredictable changes 
in personality during the first years after the 
onset of manifest psychosis. Such marked and 
unpredictable changes are rare 3 or more 
years after the onset of manifest psychosis. 
This factor makes long-term prognostic in- 
vestigations more realistic, at the present 
time, than short-term investigations. The 
rapid alternations of condition often so char- 
acteristic of the early years of schizophrenia 
render the problem of making accurate and 
meaningful follow-up statements most com- 
plex. There is a strong tendency for the 
eventual course of the illness to make itself 
known after the first 3 years. The frequency 
of essential changes in condition is exceed- 
ingly low after the first 3 years. Another 
factor which complicates short-term prog- 
nostic study is the differential effects of vari- 
ous treatment procedures. Psychotherapy and 
other therapeutic procedures are much more 
effective in the beginning of manifest psycho- 
sis than in later years when schizophrenics 
become much less responsive to environmen- 
tal influences, including therapy (Gottlieb & 
Huston, 1943). This decrease in personality 
variability with time favors long-term prog- 





128 Zygmunt 
nostic research and greatly complicates short- 
term prognostic studies. It may be mentioned 
that, in this study, differential treatment pro- 
cedures did not seem to affect the validity of 
a long-term prognostic index prediction in 
any consistent manner. 

Other attempts to prognosticate the out- 
come of schizophrenia have taken many 
courses. Kantor (1953), among others, has 
differentiated so-called “process schizophren- 
ics” from “reactive schizophrenics,” the prog- 
nosis of the latter being more favorable than 
that of the former. The reactive cases are 
characterized by a normal prepsychotic per- 
sonality and an acute onset usually accom- 
panied by a “logical” precipitating factor. 
The reactive cases are also characterized by 
a clouded sensorium. The process types are 
best characterized as “thinking disorders” 
and generally have insidious onsets of dis- 
ease. In a system generally similar to this, 
Langfeldt (1937) has distinguished so-called 
“typical” (process) and “atypical” (reactive) 
cases. 

One may also find in the literature many 
studies which list clinical symptoms or syn- 
droines of schizophrenia along with the per- 
centage of improved cases associated with 
each. 

Difficulty in using the above mentioned pro- 
cedures as prognostic tools has to do not with 
their essential validities, but with the diffi- 
culty of adapting them to the individual case. 
Many of these approaches depend on accu- 
rate and reliable case histories which are 
often impossible to obtain. It may also be 
noted that clinical signs and syndromes more 
often than not are highly variable in the in- 
dividual case. 


SUMMARY AND CONCLUSIONS 


In 1958 a Rorschach prognostic index for 
the prediction of a schizophrenic’s clinical 
condition (improved or unimproved) 3 or 
more years after testing was offered. The 
index had been validated on a group of 70 
followed-up patients (Piotrowski & Bricklin, 


A. Piotrowski and Barry Bricklin 


1958). The present report describes the sec- 
ond validation of the index on a group of 103 
schizophrenic patients, differing from the first 
patient group in many ways, including aver- 
age intelligence, age, severity of illness, and 
distributions of sexes. The results were virtu- 
ally the same. In 90% of the cases in the first 
group, and in 89% of the cases in the second 
group, the prognostic index successfully pre- 
dicted the outcome conditions of the schizo- 
phrenic patients as either improved or unim- 
proved. 

Since the implications of a long-range prog- 
nostic index which validly differentiates be- 
tween schizophrenics who will be improved or 
unimproved are obviously serious, it is ad- 
visable to submit the prognostic index to ad- 
ditional tests. It must be remembered that 
the index applies to schizophrenics, and not 
to cerebral organic cases or psychoneurotics. 
The validity of the diagnosis of schizophrenia 
is an essential factor determining the degree 
of validity of the index. 


REFERENCES 

BLevter, E 
schizo phrenic 
Press, 1950 
Gottiies, J. S., & Huston, P. E. Treatment of 
schizophrenia: Follow-up therapy in cases of in- 


Arch 


Dementia 
New 


praecox or the 
York 


group of 
International Univer 


sulin shock therapy and in control cases 
Neurol. Psychiat., 1943, 49, 266-271 

Kantor, R. E., Wartner, E., & Wenpner, C. L 
Process and reactive schizophrenia. J. consult. Psy- 
chol., 1953, 17, 157-162 

LancreLpt, G. The prognosis in schizophrenia and 
the factors influencing the course of the disease 
Acta psychiat., 1937, Suppl. 13. (See 
80, 110, 113) 

Piotrowski, Z. A., & Brickiin, B 
prognostic criterion for 
Rorschach data. Psychiat 
315-329. 

Piotrowski, Z. A., & Lewis, N. D. C. An experi- 
mental criterion for the prognostication of the 
status of schizophrenics after a three-year interval 
based on Rorschach data. In P. Hoch & J. Zubin 
(Eds.), Relation of psychological tests to psychi- 
atry. New York: Grune & Stratton, 1952. Pp. 51 
72. 


also Suppl 


A long-term 
schizophrenics based on 
Quart. Suppl., 1958, 32, 


(Received February 25, 1960) 





Journal of Con 


1961, \ 


SOME EFFECTS OF STIMULUS VARIATION ON 
SPIRAL AFTEREFFECT IN ORGANIC AND 
NONORGANIC SUBJECTS’ 


RONALD M 


Following the reports by Price and Deabler 
(1955) and Garrett, Price, and Deabler 
(1957) of studies in which perception of the 
negative spiral aftereffect (SAE) was found 
to discriminate with great accuracy between 
brain damaged patients and two nonorgani 
control groups, the clinical implications of 
this perceptual phenomenon have been studied 
by many different investigators. such 
as Gilberstadt, Schein, and Rosen (1958) and 
Philbrick (1959), have carefully followed the 
methods of Price and Deabler (1955). Others 
such as Gallese (1956), Davids, Goldenberg, 
and Laufer (1957), Spivak and Levine (1957) 
and Page, Rakita, Kaplan, and Smith (1957), 
have used somewhat different procedures and 
or scoring systems. While the variations em 
ployed in these later studies make it difficult 
to compare their results precisely, at least one 
generalization appears to be warranted. While 
brain damaged subjects (S:) as a group do 
tend to report the negative SAE 
quently than do either normal Ss or nonor- 
ganic psychiatric patients, the differences be- 
tween groups are far less clear-cut than origi- 
nally reported by Price and Deabler. 

In trying to explain this discrepancy 
eral investigators have suggested that the 
composition of the subject samples employed 


Some, 


less fre- 


SseV- 


1 Based on a dissertation submitted in partial ful 
fillment for the doctor of philosophy degree in the 
Department of Psychology, Duke University, Dur- 
ham, North Carolina. The author wishes to thank 
Gregory Kimble and the other staff members at Duke 
University, Butner (North Carolina) State Hospital, 
and Veterans Administration Hospitals at Richmond, 
Virginia, Fayetteville, North Carolina, and Durham 
North Carolina, who cooperated in this study 

2 Now at Laboratory of Neurophysiology, Depart 
ment of Physiology, University of Wisconsin Medi 
cal School, Madison, Wisconsin. 


SINDBERG 
Duke Unive 


rsity 


in different studies has been a crucial factor 
In many cases the brain 
older, more chronic patients than were the 
control groups studied. Location and type of 


damaged Ss were 


brain damage are very probably other fac- 
tors of importance. Both Gallese (1956) and 
(1957) noted that many pre- 
frontal lobotomy cases reported the SAE as 
readily as did normal Ss, Aaronson 
(1958) has suggested that involvement of the 
temporal lobes is especially likely to eliminate 
the normal SAE response. Other investigators 
have raised the possibility that the apparent 


Page et al. 


and 


decrements shown by the brain injured may 
result chiefly from an inability to report the 
aftereffect rather than from failure to perceive 
it. Gallese (1956) attempted to deal with this 
problem by using more directive and prob- 
ing instructions than did Deabler 
(1955), while at the same time he revised his 
likelihood 


would 


Price and 


scoring procedures to reduce the 
that reticence or difficulty in’ report 
contribute to low scores. 

In the studies cited above, there have been 
only minor variations in the stimulus condi 
tions under which the spiral was presented 
Although the conditions utilized by Price and 
Deabler (1955) seem to be close to optimal 
for perception of the SAE by normal Ss, no 
attempt has been made to establish the opti- 
mal conditions for its perception by the brain 
injured or for differentiating between brain 
damaged and The present 
study was designed to investigate the effects 
of varying concomitantly two easily con- 
trolled and readily quantified stimulus con- 
ditions, which, from studies of other afteref- 
fect and apparent motion phenomenon (Ham- 
mer, 1949; Teuber & Bender, 1949), seemed 


nonorganic Ss 


129 





Ronald M. Sindberg 


rABLE 1 


DESCRIPTION OF PATIENTS STUDIED 


Cortical Damage Group 


NP Cont 


Diagnosis Number Diagnosi 


Chronic brain syndrome, 
associated with trauma 


Cerebral vascular accident 

Convulsive disorder 

Cerebral arteriosclerosis 

Cortico-striato-spinal diseas« 

Alzheimer’s disease 

General paresis 

Brain tumor 

Chronic brain syndrome, alcohol 
Total 

Average Age 


Years of formal schooling 


likely to be of significance in determining 
aftereffect perception. These variables are the 
speed at which the spiral is rotated and the 
length of time for which S observes the ro- 
tating spiral. In setting up this study an at- 
tempt was made to control as many as pos- 
sible of the factors which have been suggested 
above as possible explanations for the range 
of results found by other investigators. 


METHOD 


Subjects. Three groups of Ss were included in this 
study. 

1. The Cortical Damage Group consisted of 48 
male and 2 female patients considered by the physi 
cians in charge of their respective cases to have 
demonstrable organic cortical damage. Many of them, 
of course, also had damage at subcortical 
These patients were recommended for this study by 
their case doctors or other interested staff members, 
as not too severely aphasic or otherwise distressed 
to be able to cooperate in the experiment. All of the 
available information concerning the patient’s brain 
damage was gathered from his clinical chart and 
from a short conference with his physician, 
usually a resident in neurology or neurosurgery. The 


levels 


case 


Number 


Depressive reactior 17 
Schizophrenia 10 
(Anxiety reactior 

Psychoson ati 

Character disorder 

Paranoid state 

Conversion reactio 


Phobic reaction 


Years ol lo 


listed for thes time of 
testing are given in Table 1 
2. The Neuropsychiatri 
sisted of 42 
known or 
were referred for 
older patients in this group 
were specifically recommended for this 
their physicians as showing 


diagnoses patients at the 
NP) Control Group con- 


male and female patients with no 


suspected organic brain pathology, who 
routine psychological testing. The 
60 years and older) 
study by 
no clinical signs of or 
ganic brain damage. Diagnoses of this group are also 
listed in Table 1. The patient 
rable in age and educational 
Table 1. All patients in this 
ized for 6 
them this was not a first 


groups were compa 
level, as indicated in 
study had been hospital 
although for many of 
idmission 


months or k 
3. The College Student Group consisted of 35 fe 
male and 15:male undergraduates who were required 
to take part in a number of experiments to fulfll 
their Introductory Psychology ; 
This group was, of course, 
better educated 

hospitalized. 

A total of 2 brain injur 
tients had to be eliminate 
they were unable o 

{pparatus. The degree 
Archimedes spiral made from a tracing of the one 
used by Price and Deabk 195 
ameter of 7 inches. It 


course requirements 
considerably younger and 
than the patient groups, and un- 
ed and 6 psychiatric pa 
from the study because 
unwilling to cooperate 


spiral used here was a 92( 


5), altered to a di 


rotated by a device with 





Stimulus 


a variable speed control which permitted rotation at 
speeds of from 18 to 90 rpm and a pulley arrange 
ment which allowed rotation in either direction. This 
apparatus was located slightly below eye level for 
the average S at a viewing distance of approximately 
© feet. 

Experimental design. The rotation speeds used were 
selected on the basis of a preliminary study with 
eight normal Ss as speeds which 


proximately 


represented an ap 
optimal condition (°O rpm), a defi- 
nitely nonoptimal condition (18 rpm), and the mid 
point between these extremes 

posure times were used. Ten secor 


a near-minimum time for 


rpm). Two ex 
ids appe ared to be 
SAE perception, while the 
30-second exposure was on investi 


which earlier 


gators had employed successfull 
effect reports from nearly 


Ten trials were 


obtaining after 
all of their normal Ss 
presented to each S in a fixed or 
der of decreasing difficulty as determined by the pre 
liminary study. The first eight trials represented all 
combinations of 18 and 54 rpm, 10- and second 
exposure times, and clockwise (CW) and counter 
clockwise (CCW) directions, providing a 2 
factorial design applie 


to each S. The last two trials 
were run under conditions similar to those employed 
in most previous clinical studies, thus providing a 
basis for comparison with the results of earlier in 
vestigators 

Procedure SS were isked if they could s the 
spiral clearly, and were instructed to look directly 
at it during all trials. They were told that 
would be 


che spiral 
rotated and that they were to watch for 
apparent expansion, contraction, or in depth 
or distance. The exact wording varied somewhat d« 
pending upon Ss’ educational level. Ss were asked to 
describe what the spiral appeared to be doing while 
it was rotating, and again when it was halted. Just 
before stopping the rotation on each trial, the ex 
perimenter reminded S that he was to keep looking 
at the spiral even when the apparatus was turned 
off. All responses were 


changes 


recorded as nearly verbatim 
as possible, and in doubtful cases 


conducted at the end of the 1 


a brief inquiry was 
egular trials 


TABLE 2 


PERCENTAGE OF SUBJECTS REPORTING AFTEREFFECT 
UNDER Eacu Set oF CONDITIONS 


Cortical NP 
Conditions Damage Student 
18-CCW-10 W/ 40, 
18-CW-10 2 74 
54-CW-10 90 
54-CCW-10 2 Oo 
18-CCW-30 76 
18-CW-30 22 94 
54-CW-30 2 2 100 
54-CCW-30 86 
90-CCW-30 9S 
90-CW-30 2 100 


Variation and Spiral Aftereffect 


100 
90 
804 
70-7 
604 
50 4 
40 


4 








ie) 








PERCENT OF SUBJECTS 


i 


Cortical Damage | 
AFTER ROTATION | 








5 6 10 
CONDITIONS 


1. Percentage of subjects reporting apparent 


size and/or depth changes 


RESULTS 


Of primary interest in this study were the 
effects of stimulus variation upon report of 
the spiral aftereffect by the different groups 
of Ss. Table 2 shows the percentage of Ss in 
each group reporting the SAE under each 
combination of conditions. The sets of condi- 
tions are listed in the fixed order in which 
they were presented, with rotation speed given 
first, followed by the direction of rotation, 
and then by the exposure time. Clockwise ro- 
tation normally produces an aftereffect of ex- 
pansion, while the CCW rotation usually pro- 
duces a contraction aftereffect. 

Although the SAE was of particular con- 
cern, the responses given both during rotation 
of the spiral and immediately after its cessa- 
tion were analyzed. Figure 1 presents the per- 
centage of Ss in each group who reported ap- 
parent size and/or depth changes during ro- 
tation and during the usual aftereffect period 
for each set of conditions. The sets of condi- 
tions are listed in the order in which they 
were presented. Obviously there is little dif- 
ference among these groups with regard to 





Ronald M. Sindberg 


TABLE 3 


SUMMARY OF ANALYSES OF VARIANCE OF 


PROPORTIONS OF SUBJECTS REPORTING 


\FTEREFFECT 


Mean Square 


Nonorganic 


Exposure Times (A) 
Rotation Speeds (B 
Rotation Direction (C 


0.4857 
0.1567 
0.4349 


0.0210 
0.0042 
0.0004 
0.0101 


0.0100 


Total 


perception of apparent size and depth changes 
during rotation of the spiral. A medians chi 
square test done on these data supports this 
conclusion. There is, however, a clear differ- 
ence with regard to the SAE. While the Stu- 
dent and NP Control Groups respond very 
similarly, the Cortical Damage Group reports 
the SAE much less frequently. The condi- 
tions which apparently produce the greatest 
differentiation of organic and nonorganic Ss 
are those of medium difficulty. The condi- 
tions most nearly approximating those of 
previous studies discriminate much less well. 

Because the response of the Cortical Dam- 
age Group was so clearly different from that 
of the other two groups, separate analyses of 
variance of proportions were done, using the 


NP Control 


Cortical Domage Fol 
Students. 1 


NUMBER OF SUBJECTS 


0 
0 | 2 3 


3. 2. Number of subjects receiving each SAE score 


Organi Organ 
0.2560 ] 12.80** 
0.1580 7** 7.90* 
0.0623 3. 49* 312 


0.0070 0.35 
0.0099 0.49 
0.0001 0.01 
0.0199 1.00 


0.0200 


method of Walker and Lev (1953). These 
analyses are summarized in Table 3. All main 
effects were significant for the combined Stu- 
dent and NP Control Groups, while for the 
exposure time and 
rotation speed had significant effects, but di- 
rection of rotation did not. None of the inter- 
actions between variables was statistically sig- 
nificant. 


Cortical Damage Group 


Since the clinical efficacy of this phenome- 
non as a tool for use in the diagnosis of brain 
damage has been a major point of contention 
in the literature, each response of every S was 
assigned a score of 1 if it indicated the nor- 
mal SAE, 0 if it did not, following the pro- 
cedure of (1956). Thus each S re- 
ceived a score which was equal to the number 
of normal SAE responses. Figure 2 shows the 
number of Ss receiving each score. As this fig- 
ure shows, the Cortical Damage Group differs 
markedly from the other groups in terms of 
the number of SAE responses reported. Of the 
50 Cortical Damage Ss, 44 reported the SAE 
fewer than 6 times in 10 trials, while 46 of 
the 50 NP Controls reported the SAE on 6 or 
more of their 10 trials. A medians chi square 
test on these data indicates that this differ- 


Gallese 


ence between groups is significant well beyond 
the .001 level. 
Information concer 
patient’s brain damage, 
ascertained, is presented in Table 4, 


ning the location of each 
insofar as it could be 
along 





Stimulus Variation and Spiral Aftereffect 


rABLE 4 


\PTEREFFECT “SCORES” FOR DIFFERENT 
AREAS OF BRAIN DAMAGE 


ation of Damage Mean 


Generalized, bilateral 1.26 
Generalized, right side 2.00 
Generalized, left side 1.67 
Bilateral Frontal 1.00 
Bilateral Frontal-Tempor 0.00 
Left Frontal 10.00 
Left Frontal-Temporal 5.20 
Left Temporal 1.00 
Left Occipital 2.00 
Right Frontal-Tempora 0.00 
Right Temporal 9.00 
Right Parietal 4.67 


Bilateral Damage 
Unilateral Damage 


with data on the SAE scores for patients with 
damage in each area. Only 2 of the 50 pa- 
tients in this group came to autopsy during 
the period of this study, and even in these 
cases complete histological examination of the 
brain was not carried out. Therefore, it must 
be recognized that the location of damage 
is not always clearly delineated or precisely 
placed. These data, based on the best infor- 
mation available, can only be offered as being 
suggestive, not conclusive. Unfortunately, the 
number of patients with damage apparently 
limited to any one discrete aera is too small 
to permit drawing definite conclusions about 
the relationship between the location of dam- 
age and report of the SAE. There do, how- 
ever, appear to be differences associated with 
the extent of damage, bilateral or generalized 
damage being associated with lower 
than unilateral or localized damage. A ¢ test, 
with correction for heterogeneity of variance, 
indicated that the unilateral vs. bilateral dif- 
ference was statistically significant (p < .01). 


scores 


DISCUSSION 


As might have been anticipated from the 
work done on other perceptual phenomena, 
the results of this study show that both short- 
ening the time of exposure to the rotating 
spiral and decreasing its rotation speed make 
perception of the SAE more difficult. This is 
true for both brain damaged and nonorganic 
Ss. From the analyses of variance presented 
in Table 3, it appears that the variable which 
contributes most to the probability of occur- 
rence of the SAE is the length of time for 


which the rotating spiral is observed. The 
amount of variance attributable to rotation 
speed is also statistically significant for all 
groups, but the actual sums of squares are 
much smaller than those for exposure time. It 
is interesting that direction of rotation should 
be a significant variable for the nonorganic 
Ss but not for the organic group. Apparently 
this is due to the fact that perception of the 
SAE was so difficult for the latter group un- 
der all but the most optimal conditions that 
the increment of difficulty added by CCW ro- 
tation made very little overall difference 

The results of this study are clearly in 
agreement with Price and Deabler’s (1955) 
contention that report or nonreport of the 
SAE discriminates between brain damaged 
and nonorganic Ss with a fairly high degree 
of accuracy. Using the series of stimulus con- 
ditions developed for this experiment and a 
cutoff score at the median point for all pa- 
tients, i.e., between five and six reports of the 
SAE in 10 trials, 88% of the Cortical Dam- 
age and 92% of the NP Control patients are 
correctly identified. However, it must be 
pointed out that these Ss are not a random 
sample from a general hospital population, 
but rather specific patients for whom there 
was evidence of organicity or a lack of such 
evidence. 


The patients who are misclassified by this 
method of scoring can be clearly separated 
into two categories. Two of the Ss in each 


group received borderline scores, i.e., re- 
ported the SAE sufficiently often to be just 
above or just below the cutoff point 
Cortical Damage Ss and two NP 
are markedly atypical of their 
groups in report of the SAE. 

The two NP Control Ss who failed to re 
port the SAE on any trial were both over 60 
years of age. While neither showed any ob- 
vious clinical signs of deterioration such as 
are usually associated with brain damage, one 
could argue that in the absence of histological 
controls, organic brain pathology in Ss of this 
age could very well have been undetected in 
the general clinical picture, while still affect- 
ing SAE report. It should be pointed out, 
however, that many elderly patients received 
maximal or near-maximal SAE scores in this 
study. 


Four 
Controls 
respective 





134 


The four Cortical Damage Ss who reported 
the SAE 9 or 10 times in 10 trials show a 
definite similarity in their respective clinical 
pictures, although the diagnosed area and type 
of damage is different in each case. Each of 
these patients has had very mild damage, al- 
ways very localized (as far as is known), and 
each shows little or no neurological or intel- 
lectual impairment. This would again support 
the conclusion that size and severity of dam- 
age are highly important factors in determin- 
ing whether or not the SAE is reported. Ex- 
tensive and/or severe damage to any part of 
the cortex appears much more likely to de- 
stroy the SAE than is localized damage. This 
may very well be one of the principal reasons 
for the great variation in the results reported 
by different investigators who have studied 
the spiral as a clinical diagnostic device. It 
seems a likely explanation for the “normal” 
responses of many of the lobotomy cases and 
epileptic Ss studied by various researchers. 

There is still the possibility, however, that 
difficulty in verbal report, which might be ex- 
pected to correlate with the severity of or- 
ganic damage, is the crucial factor. Both 
Gallese (1956) and Aaronson (1958) have 
raised this possibility. Furthermore, Schein 
(1960) and Van de Castle and Strong (1957) 
noted that with many patients who did not 
report the SAE spontaneously, a slight altera- 
tion of the test stimulus would elicit such a 
report. The latter findings could mean that 
these patients were having difficulty in verbal- 
izing the SAE, much as Aaronson’s patients 
with anomia apparently did. It could also sug- 
gest, however, that this was a reflection of 
psychological rigidity, of the inability to shift 
one’s psychological set, which has so fre- 
quently been described as characteristic of 
the brain injured. Because of the explicit in- 
structions used in the present study and the 
inquiry conducted in doubtful cases, it is un- 
likely that the differences in SAE report seen 
here can be attributed primarily to verbal in- 
efficiency. Furthermore, in the analysis of re- 
sponses given during rotation of the spiral it 
was found that all except three of the Cortical 
Damage Ss reported the occurrence of appar- 
ent size and/or depth changes during rota- 
tion, indicating that it was not an inability 
to perceive, conceptualize, or verbalize such 


Ronald M. Sindberg 


phenomena which led to nonreport of the 
SAE. 

Nevertheless, the results of the above stud- 
ies raise a very important question. Can it be 
said that many brain damaged patients do 
not perceive the SAE, i.e., do not consciously 
experience it unless a new test figure is sub- 
stituted for the original? If so, this implies 
that the apparent loss of the SAE in brain 
damaged patients is due to something more 
than just a destruction of sensory elements in 
the cortical visual system. It would appear 
that for spontaneous perception of the SAE 
not only must the neuronal chains from retina 
to visual cortex be reasonably intact, but also 
that other parts of the brain which are in- 
volved in organizing 
information into conscious awareness 
must be functional. If this is correct, it 
should not be surprising that mild, focal dam- 
age almost anywhere in the cortex does not 
interfere with the SAE, while larger, more 
severe damage usually does. 


and transforming sen- 


sory 


It appears to me that much of our current 
theorizing concerning the apparent destruc- 
tion of the SAE by brain damage has been 
much too narrow in scope. We have been at- 
tempting to seek out a single comprehensive 
factor which would explain both the failure 
of many brain damaged Ss to report the SAE 
and the contradictory evidence which has 
been reported in several studies. The factor 
of impaired verbal report is perhaps the most 
frequent “cause” invoked to explain this dis- 
crepancy. I believe that in doing so, we have 
been very much inclined to oversimplify what 
is in actuality an extremely complex process. 
Because of the striking character of the phe- 
nomenon, it is easy to forget that the percep- 
tion of the SAE is in itself a highly complex 
operation. Multitudinous factors are involved 
in determining whether or not a particular S 
will perceive the SAE on any given trial. In 
the present study many Ss, even among the 


college students, sometimes failed to perceive 


the SAE under what are usually favorable 
conditions after having reported the phe- 
nomenon under more difficult conditions. This 
study has provided a systematic investigation 
of several stimulus with certain 
subject variables controlled, but there are 
many variables which were not, and perhaps 


variables, 





Stimulus Variation and Spiral Afterefiect 


could not be controlled in such a clinical ex- 
periment. We know from the work of Wohlge- 
muth (1911) and others that such variables 
as the size and viewing distance of the spiral, 
the state of light or dark adaptation of the 
eye, the intertrial interval, the constancy of 
fixation, and the fatigue state of the S can all 
be important. Brain damage, then, is not the 
only factor which may prevent the occur- 
rence of the SAF, and likewise, brain damage 
alone may not prevent such occurrence under 
otherwise favorable conditions. 

Not only is the SAE a complex phenome- 
non, but the brain itself is such an extremely 
complex structure with at least some degree 
of localization of function, that it seems to 
me completely unreasonable to expect that 
brain damage as a general clinical entity 
should affect SAE perception in the same 
way in each patient. Damage of different de- 
grees of severity to different areas of the brain 
will produce different effects, any one or more 
of which may be crucial with regard to SAE 
report in a given case. Frontal lobe damage, 
especially with involvement of the prefrontal 
region, frequently interferes with the ability 
of Ss to attend or concentrate and greatly 
increases their distractibility (Peale, 1954). 
Thus, in many cases the SAE may not be 
perceived primarily because of the inability 
to maintain fixation. Goldstein (1942) tells 
us also that damage to the frontal lobes im- 
pairs the patient’s ability to take and hold 
the abstract attitude. His methods for the 
measurement of such impairment would sug- 
gest that this is closely related to an inability 
to shift psychological set, and it may be that 
such patients cannot consciously experience 
the SAE unless something is done to break 
their original set, as Schein (1960) did. In 
still other cases, damage to the temporal lobe 
producing an aphasic disturbance could re- 
sult in an inability to report the SAE, even 
if it is consciously perceived. Both Teuber 
and Bender (1949) and Werner and Thuma 
(1942) found that unusually short interstimu- 
lus intervals were necessary to produce appar- 
ent movement phenomena in their brain dam- 
aged Ss, suggesting that sort of psy- 
chophysiological deficit was occurring in the 
integration of discrete visual stimuli. In many 
cases with diffuse and severe cortical damage 


some 


visual and/or “associa- 
cortex may be so disrupted as to pre- 
vent reception and integration of the stimuli 
impinging upon the retina. 

In the present study, the use of some stimu- 
lus conditions which make SAE perception 
difficult even for normal Ss added a further 
increment of difficulty to those noted above. 
Thus, in addition to the different physiologi- 
cal effects produced by shorter, more slowly 
moving stimuli, the unusually long series of 
trials increased the tendency for physical and 
psychological fatigue to interfere with S’s con- 
centration on the task at hand. These addi- 
tional handicaps for the brain damaged pa- 
tient, whose SAE report may already have 
been impaired by one or more of the factors 
suggested above, probably produced the rela- 
tively clear-cut differentiation of brain dam- 
aged and nonorganic groups in the present 
study. 


the cortical system 


tion” 


SUMMARY 


This study was designed to investigate the 
effects of certain stimulus variables upon per- 
ception of the spiral aftereffect (SAE). A 
920° Archimedes spiral was presented to 50 
patients with cortical brain damage, 50 psy- 
chiatric patients with no known or suspected 
brain damage, and 50 college students in a 
standard series of trials. These trials contained 
all combinations of 18 and 54 rpm rotation 
speeds, 10- and 30-second exposure times, and 
clockwise and counterclockwise rotations. One 
additional trial in each direction was given at 
90 rpm with a 30-second exposure time, ap- 
proximating the conditions used by other in- 
vestigators. 

It was found that during rotation of the 
spiral the patient groups reported apparent 
size and/or depth changes with approximately 
equal frequency. Following rotation, however, 
46 of the 50 NP Control Ss reported the SAE 
6 or more times in 10 trials, while only 6 
of the Cortical Damage Ss did likewise. The 
college students responded almost identically 
with the NP Controls. The atypical psychi- 
atric patients tended to be elderly people, 
raising the possibility of undiagnosed brain 
damage associated with aging, while the atypi- 
cal Cortical Damage Ss were characterized by 
mild and localized damage, with little or no 





136 


clinical neurological or intellectual impair- 
ment. Although histological evidence was not 
available, the data strongly suggest that lo- 
cation of damage is relatively unmiportant, 
but that extensive or severe damage to any 
part of the cortex markedly reduces the prob- 
ability of SAE report. 

Analyses of variance indicated that ex- 
posure time and rotation speed were signifi- 
cant factors in perception of the SAE for all 
subject groups, while direction of rotation was 
a significant variable only for the nonorganic 
groups. Conditions of medium difficulty dis- 
criminated best between the patient groups, 
while conditions similar to those of previous 
studies were less discriminating. 

It was concluded that multiple factors are 
involved in the disruption of SAE report in 
brain damaged Ss, and that the increment of 
difficulty added by a long series of relatively 
difficult trials contributed to the clear-cut dis- 
crimination between groups in this study. 

REFERENCES 
Aaronson, B. S. Age, intelligence, aphasia and the 
spiral aftereffect in an epileptic population. J. clin 

Psychol., 1958, 14, 18-21 
Davips, A., GotpensBerc, L., & Laurer, M. W. The 

relation of the Archimedes spiral aftereffect and 

the Trail Making Test to brain damage in chil 

dren. J. consult. Psychol., 1957, 21, 429-433 
Gatiese, A. J. Spiral aftereffect as a test for 

ganic brain damage. J. clin. Psychol., 1956, 

254-258. 

Garrett, E. S., Price, A. C., & Deaster, H. L 

Diagnostic testing for cortical brain impairment 

AMA Arch. Neurol. Psychiat., 1957, 77, 223-225 


or- 


12, 


Ronald M. 


Sind berg 


GiLperstapt, H., Scuemn, J. D., & Rosen, A. Fur 
ther evaluation of the Archimedes spiral aftereffect 

Psychol., 1958, 22, 243-248 

GOLDSTEIN, K injuries in 
New York: Grune & Stratton, 1942 

Hammer, E. R. Temporal factors in figural after 
effects. Amer. J. Psychol., 1949, 62, 337-354. 

Pace, H. A., Raxita, G., Kapitan, H. K., & Smita, 
N. B. Another application of the spiral aftereffect 
in the determination of brain damage. J. 
Psychol., 1957, 21, 89-91 

PeaLe, T. L. The neuroanatomical basis for clinical 
neurology. New York: McGraw-Hill, 1954. 

Puitprick, E. B. The validity of the 

clinical tool for diagnosis of 

pathology. J nsult. Psychol. 


J. consult 


Aftereffects of brain war 


consult 


spiral after- 
organi 


1959, 23, 


effect as a 
brain 
39-43, 
Pesce, A. C., & 
ganicity by means of spiral 
Psychol., 1955, 19, 299-302 
Scuern, J. D. Personal communication, 1960 
Spivak, G., & Levine, M. The spiral aftereffect and 
reversible figures as measures of brain damage and 
memory. J. Pers., 1957, 25, 767 
Tevuser, H. L., & Benper, M 
vision following trauma in the 
man. J. gen. Psychol., 1949 
CastLe, R. L., & 
study on the diagnosti 
aftereffect. Paper read at Southeastern Psychologi 
cal Association, Nashville, Tennessee, March 1957 
Wacker, H. M., & Lev, J. St New 
York: Holt, 1953 
Werner, H., & TuHuma, A. A deficiency in the per 
ception of apparent motion in children with brain 
injury. Amer. J. Psychol., 1942, 55, 58-67. 
WouicemutTH, A. On the after-effect of seen move- 
ment. Brit. J. Psychol., 1911, Monogr. Suppl. 1, 
1-117. 


Deaster, H. L. Diagnosis of or 
aftereffect. J 


. , 
consult 


Alterations in pattern 
occipital lobe in 
40, 37-57 

N. A further 
the spiral 


VAN DE Stronc, P 


efficiency of 


tistical inference 


(Received Februar 





INTELLECTUAL FUNCTIONING IN A GROUP OF 
NORMAL OCTOGENARIANS' 


ANDREW S. DIBNER ano JAMES F. CUMMINS 
n Outpatient Clinic, Boston, Massachusett 


This paper presents some empirical findings METHOD 
on the intellectual functioning of 50 men who Subjects. Table 1 presents some descriptive infor 
were in service during the Spanish American mation about the subjects. The Total Group of 
War and who now average 80 years of age. men range in age from 73 to 89 years, with mean 
These men live in the Greater Boston Area 22¢ at 80.4, standard deviation 2.1 years. Their edu 
and responded to an invitation to attend the te ranges from no formal education to college 
° es ° “2° ° graduate (16 years). Mean education for the Total 
research oriented Geriatric Clinic established Group is 7.9 years. Their highest achieved occupa 
at the Boston Veterans Administration Out- tional status as rated according to Warner's scak 
patient Clinic in 1958. The functions of the (Warner, Meeker, & Eels, 1949) ranges from 1 (the 
highest level, as professional) to 7 (the lowest level 
as laborer) with a mean occupational level of 4.2 
. For the purpose of intragroup analysis the Tota 
(in press). Group was divided into Younger and Older Groups 
The questions asked by this study were The 23 subjects in the Younger Group range in agi 
the following: First of all, how would these {t°™ 73 to 79 years and their mean age is 77.6. The 
80-year-old, relatively healthy men perform er peso — Group range in age from 80 to 89 
on the Wechsler-Bellevue Intelligence Scale Pde aye 8 a dee Minion Group. As can be seen 
(Wechsler, 1944) and how would their re- from Table 1, the two groups are highly similar in 
sults compare with those reported in other education and highest achieved occupational level 
studies of older people? Secondly, is there Procedure Subjects were administered the 11 sub- 
° “— +: “i tests of the Wechsler-Bellevue Intelligence Scale 
evidence of intellectual decline as measured E — 
: “te : k orm I, by experienced examiners using standard in 
by this test, and if so, how does this evi- structions with the exception of extending time limits 


clinic as well as social data on these men are 
described elsewhere by Nichols and Cummins 


) 


8 years, approximately 5 years olde: 


dence compare with findings on other age on all timed tests except Digit Symbol. The method 

groups; and, is this intellectual decline re- is similar to that employed by Doppelt and Wallac: 
f : 955) 3 s done in order to compare perform 

lated to level of intelligence as has been some- 1955) and was done in ord Sr py * 

‘ ; a, 298 : er ance under standard (ST) and extended time (ET 

times claimed? Thirdly, do the time limits jj nit. 

established for the Wechsler-Bellevue scale. Although the Vocabulary subtest was administered 

Form I, inordinately penalize the slower, it was not considered in determining the Verba 

older person? weighted scores or IQs in order to make thes 


on , . sults more comparable with those of other investi 
The Wechsler-Bellevue Form I scale was gators The IQs were determined by the extrapola 


employed rather than the newer Wechsler _ tion technique suggested by Wechsler (1944) for age 
Adult Intelligence Scale (Wechsler, 1953) be- srtoups beyond those given in his tables 
cause there are more data available on Form | 
performance by various pathological groups 
and normal age groups with which these data Table 2 presents the Verbal, Performance 
might be compared. and Total weighted scores and IQs of the 
Younger, Older, and Total Groups. With re 
nual Meeting of the Gerontological Society, Inc., pam weighted eer differences between 
Detroit. Michigan. November 1959, resulting from SOUPS are slight and nonsignificant; though 
an investigation carried out at the Boston Veterans CUriously, the Older subjects outdo the 
Administration Outpatient Clini Younger on the performance scale, where they 


137 


RESULTS 


1 Based upon a paper presented at the twelfth An 





Andrew S. 


Dibner and James F. 


Cummins 


rABLE 1 


AGE, EDUCATION, 


Age in Years 


Group Range Mf SD 
Younger 73-79 
Older 80-89 
Total - 73-89 


TABLE 2 


VERBAL, PERFORMANCE, AND FULL SCALE WEIGHTEI 
Scores AND IQs FOR YOUNGER, OLDER, AND 
TotraL Groups 


Older 
Group 


ounger 


Group 
SD SD 

Verbal WS 11.8 5 10.1 

Performance WS 9.6 2 9.5 

Full Scale WS f 18.1 8 16.3 

Verbal IQ 2 108 9.2 


Performance IQ 7 9.9 2 10.1 
Full Scale 1Q 108.7 2 10.2 9.3 


might be expected to do worse. This result is 
probably a sampling fluctuation. The superi- 
ority of verbal weighted score over perform- 
ance weighted score is statistically significant 
for both Older and Younger subjects. 

With respect to IQ, there are no significant 
differences between groups. The mean IQ, 
using Wechsler’s extrapolation method, is 109. 

Table 3 presents the means and standard 
deviations of subtest scores for the subgroups 
and Total Group. The highest subtest scores 
for the Total Group are Vocabulary, Infor- 
mation, and Comprehension, with Average 
weighted scores of 10 or 11. The next lower 
group of scores, averaging about 8 or 9. were 
those of Arithmetic, Similarities, Picture Com- 
pletion, and Object Assembly. The lowest 
grouping of scores were those on Digit Span, 
Block Design, Picture Arrangement, and Digit 
Symbol, averaging 5 or 6. 

It is worthy of note that the Information, 
Comprehension, and Vocabulary scores above 
10 are consistent with the above average IQs 
yielded for the test as a whole by Wechsler’s 
extrapolation method. 


AND OCCUPATION OF 


SUBJECT 


Occupational Rating 


Education in Years Warner’s Scale 


Range SD 


4-14 
0-16 
0-16 


One may examine the pattern of differential 
decline of the various subtests by two means: 
first by comparing the differential scatter of 
subtests which occurred within the two age 
groups, and then by comparing these subjects’ 
scatter of subtest scores with that reported 
by other investigators with aged subjects. 

Figure 1 demonstrates the similarity of sub- 
test scatter for the Younger and Older Groups 
within the sample. First, we note that the 
Younger men score higher than the Older 
men on 8 of the 11 subtests, although none 
of these differences is significant by ¢ test. 
Secondly, we note that the pattern of per- 
formance of the two groups on the various 
subtests is highly similar. This is evidence 
within this particular sample of the reliability 
of the differential decline of subtest perform- 
ance which has been reported for age by 
Wechsler (1953) and others. 


TABLE 3 


MEANS AND 
WEIGHTED 


STANDARD DEVIATIONS OF 
SCORES FOR YOUNGER, 
AND ToTaAL Groups 


SUBTEST 
OLDER 


Older Total 


(;sroup Group 


Subtest } SD 


Information 
Comprehension 
Digit Span 
Arithmetic 
Similarities 


Nm NM bh 


Vocabulary 

Pic. Arrangement 
Pic. Completion 
Block Design 
Obj. Assembly 
Digit Symbol 


—we WWNN WH NW Ww 
se Uhm UN 

mmm Wh dw dw 
“InN w 





Intellectual Functioning in Octogenarians 


onawane Younger Group 
—- Older Group 


Weighted Scores 








SubtesTs 


Fr 1 Mean weighted subte st scores tor 


younger and 
older groups 


Table 4 presents the subtest performance 
of this aged group contrasted with that of 
others reported in the literature, with subtests 
ranked in terms of their average weighted 
scores. Although these various samples were 
drawn partly from institutions and partly 
from the community, there seems to be good 
agreement at the ends of the scale. As indi- 
cated by the averages of the ranks across 


139 


studies, the abilities contributing to perform- 
ance in the Information, Comprehension, and 
\rithmetic subtests are best retained in the 
older years, and those involved in perform- 
ance on the Block Design, Picture Arrange- 
ment, and Digit Symbol subtests are least 
well retained. The major factor involved in 
the last mentioned three is probably speed. 

The subtests ranked in the middle 
range do not clearly distinguish themselves 
from each other in terms of differential de- 
cline. 


four 


Wechsler developed a method of estimating 
deterioration of functioning by comparing the 
sum of scores of the subtests which have been 
found to decline more slowly with age with 
the sum of scores of those subtests on which 
performance tends to be significantly im- 
paired with age. This so-called deterioration 
was measured in the present group and com- 
pared with Wechsler’s norms for younger age 
groups. The result is shown in Figure 2. 

Using Wechsler’s formula for deterioration 
quotient (or DQ) shown in the lower left- 
hand corner of Figure 2, the average DQ for 
the Younger Group was found to be 70.1 and 
for the Older Group to be 66.5. These are 
plotted along with the DQs for various age 
groups reported by Wechsler. They seem to 
fall directly on an extrapolation of the em- 
pirical curve determined by Wechsler. Thus, 
the same rate of decline of abilities relative to 
each other is occurring in this sample as in 


TABLE 4 


SUBTESTS RANKED \VERAGI 


Fox & 
Birren 
1950 


Present 


study 


V 50 , 50 
Average Age 80 5 64 
Subtests: 

Information 

Comprehension 

Arithmetic 

Pic Completion 

Similarities 

Objec t Assembly 

Digit Span 

Block Design 

Pic. Arrangement 

Digit Sy mbol 


WEIGHTED SCORE IN 


PRESENT STUDY AND OTHERS 


Madonick 
& Solomon Chesrow 


1947 


Howell 


1955 


Howell 


1955 


Average 
1949 of Ranks 
50 20 16 


2 ) 7 


~s ue whe 
— 


no 


nauw eS Uw 





Andrew S. 


Present 
Data 


Wechsler Data 


H 


3 


? 


£ rg Sp. Arith. + B/ bes. +D Symb) 
£ (Infotec #P6.404) 


PIIP IS IGI IIIS 


Age Groups 





Deterioration Quotient 
-) 
oO 


x Joo 


= 


das 








Fic. 2. Average deterioration quotients at different 
ages—Wechsler’s norms and present results. 


Wechsler’s sample despite the fact that the 
present sample is of higher average intelli- 
gence. The DQ, as an intratest ratio for each 
individual, is relatively independent of the 
level of subtest scores. 

The relation of deterioration to present 
level of intelligence was tested further by 
correlating DQ with IQ. The correlation was 
.03. DQ was then correlated with two other 
measures which can be considered to be re- 
lated to past functional intelligence, namely, 
education and highest occupational level 
reached. The correlations were —.01 and .01. 
Thus no evidence was found of relationship 
between present or past intellectual level and 
amount of present intellectual decline. 

Now let us consider the effects of using 
standard time limits or extended time limits 
with this older group. 

Table 5 presents the changes in weighted 
scores on four timed subtests which occurred 
by allowing subjects extra time. Twelve per- 
cent of the subjects increased their Arithmetic 
score an average of 1.5 points, resulting in a 
rise in the group average of only 0.2 of a 


Dibner and James F. 


Cummins 


rABLE 5 

MEAN WEIGHTED SCORES 
Four Sustests | 
EXTENDED 


AND CHANGES IN SCORES 
G STANDARD AND 
Time Lruits 


ON 


Mean 

SD 

Percentage of 
Group Changed 
prage Change 


rho 


point. The change in the Picture Arrangement 
subtest was also slight, as was that in the Ob- 
ject Assembly subtest. However, 43% of the 
subjects improved their Block Design score 
when given extra time. These changes did not 
appreciably disturb the rank-ordering of the 
subjects within the sample as shown by the 
high correlations between scores under stand- 
ard and extended conditions. 

Table 6 presents the changes in IQ which 
resulted from using extended time limits. Only 
12% of the subjects increased their Verbal 
IQ, and the group average raised only slightly 
from 109.8 to 110.1, however, 60% of the 
Performance IQs were raised, bringing the 
average for the group up from 112 to almost 
114, and 61% of the Full Scale IQs were 
raised by allowing extra time, however rais- 
ing the group average but one point. The rank 
correlations between IQ scores under standard 
and extended time conditions were extremely 
high, indicating that these older subjects do 
not appreciably change their ranking in rela- 
tion to each other as a result of having ex- 
tended time limits. 


TABLE 6 


MEAN VERBAL, PERFORMANCE, AND FULL SCALE IQs AND CHANGES 


N IQs 


Ustnc STANDARD AND EXTENDED Time Limits 





Verbal IQ 


ST 





109.8 
9.2 
12% 
1.5 
.98 


Mean 

SD 

Percentage of Group Changed 
Average Change 

rho 





ET 


110.1 


Performance IQ Full Scale IQ 


ST ET ET 


ST 
112.1 
10.1 


113.9 
9.2 9.8 
60%, 
3.2 


95 





Intellectual Functioning in Octogenarians 


SUMMARY 


A group of 50 relatively healthy, male vet 
erans of the Spanish American War, who now 
average 80 years of age, was found to be 
above average in intelligence, performing well 
in tests measuring retention and comprehen- 
sion of verbal material but performing poorly 
in tests affected by psychomotor speed and 
abstract thinking. The pattern of decline of 
various abilities tested is consistent within the 
younger and older men of the sample and 
consistent with other studies of older groups. 
The deterioration quotient for the Total Group 
follows closely an extrapolation of the curve 
of deterioration with age empirically derived 
by Wechsler. Extent of intellectual decline was 
found to be unrelated to level of intelligence 
as measured by IQ, education, or highest oc- 
cupational level reached. 

The use of standard time limits was found 
to appreciably affect the older person’s score 
on the Block Design subtest, but not the 


Arithmetic, Picture Arrangement, or Object 
Assembly subtests. Standard time limits were 
found to depress the Performance IQ two 
points and Full Scale IQ one point, but do 
not appreciably affect older persons’ relative 


rankings within their 
scores are concerned. 


group as far as IQ 


REFERENCES 


, Wosrka, P. H., & 
A psychometric 


Cursrow, E. J Reiitz, A. H 


evaluation of white males 

s, 1949, 4, 169-177. 

Doprett, J., & WaLtace, W. Standardization of the 
WAIS for older persons. J. abnorm. soc. Psychol, 
1955, $1, 312-330. 

Fox, C., & Birren, J. E. The differential decline of 
subtest scores on the Wechsler-Bellevue Intelligenc: 
Scale in 60-69-year-old individuals. J. genet. Psy 
chol., 1950, 77, 313-317 


Howe tt, R. J 


aged 
Geriatru 


hsler 


1955, 


Changes in Wex subtest 
with age. J. consult. Psychol., 19, 47-50 
Maponick, M. J., & Sotomon, M. The Wechsler 
sellevue scale in individuals past sixty. Geriatric 

1947, 2, 34-40 
Nicnots, M. R., & Cummins, J. F. The lifelong so 
cial adjustment of a group of normal octogenarians 


scores 


Geriatrics, in press. 

Rasin, A. I. Psychometric trends in senility and psy 
choses of the senium. J. gen. Psychol., 1945, 32, 
149-162. 

Warner, W 
in America 
1949. 


L., Meeker, M., & Eexs, K. Social cla 

Chicago: Science Research Associates 

Wecuster, D. Measurement of adult intelligence 
(3rd ed.) Baltimore: Williams & Wilkins, 1944 

Wecuster, D. The measurement and appraisal of 
adult intelligence. (4th ed.) Williams & 
Wilkins, 1953. 


saltimore 


(Received March 10, 196 





Journal of Consulting Psych 
1961, Vol. 25 


, 
ology 


h 
2, 142-145 


PREDICTION OF RELAPSE FOR PSYCHIATRIC 


DAVID J 


Western Psychiatric 


In a review of the literature on prognostic 
measures in psychopathology Windle (1952) 
has commented favorably on Feldman’s 
(1951) Ps scale. This scale was developed to 
predict the probability of success of shock 
treatment and consists of 52 items derived 
from the Minnesota Multiphasic Personality 
Inventory. Windle (1952) suggested that 
“further work in prognosis should employ 
this scale, or just as desirable, previously 
gathered data should be reanalyzed with it” 
(p. 466). Only two reports on cross-valida- 
tion studies of the Ps scale were located 
(Pumroy & Kogan, 1958; Roberts, 1959). 
Both failed to confirm Feldman’s own cross- 
validation findings. This paper, which reports 
a reanalysis of previously gathered data, 
presents further validational evidence for, and 
suggests possible interpretations of, the Ps 
scale. 


SAMPLE DESCRIPTION 


A search of the files yielded complete MMPI 
records for 104 inpatients of the Western Psychi- 
atric Institute and Clinic, tested between 1944 and 
1951, but mostly in the period 1946-47. Of these, 
6 were excluded on the major organic 
involvement, and 4 because they had obtained a 
“?” raw score in excess of 100. This left 94 cases 
with usable MMPI records. 

As the reasons for the original test referrals were 
in many cases unknown, these 94 patients could not 
be regarded as an unselected sample of the hospital 
population at that time. However, since any bias 
in this sample presumably would not affect the 
range of variation of either the predictors or the 
criterion used, the possible unrepresentativeness of 
the sample was not regarded as a limitation. 

Feldman (1951) listed several criteria which have 
to be met before his Ps scale can be validly em- 
ployed. Of the 94 cases, 60 met all these require- 
ments, while the other 34 failed to meet only the 

1 On leave from the University of Pretoria, South 
Africa. It is a pleasure to acknowledge the help of 
Edith Fleming in obtaining the follow-up data on 
the patients used in this study. 


basis of 


PATIENTS 


GOUWS ! 


Institute and Clinic 


requirement of elevated scores on one or more of 
the critical These will 
be referred to as the and 
groups, respectively 

The following dichotomous “criterion of improve- 
ment” was used in this study 
were all those who obtained 
on leaving hospital and 


scales groups subsequently 


“elevated” “unelevated’ 


“Improved” cases 
an “improved” rating 
not readmitted 
for psychiatric reasons to this or to another hospital 
in the 5 years following “Unimproved” 
cases were those rated “unimproved” on leaving 
hospital (usually for transier to another institution), 


who were 


discharge 


as well as those cases with an improved rating upon 
leaving hospital, but with a record of a subsequent 
relapse severe enough to require psychiatric rehospi 
talization in the 5 (The 
shortest period of such rehospitalization was 3 
months). The information on rehospitalization of 
former patients had gathered at 6-month 
intervals from the referring hospitals and physicians, 
and is complete, so far as is 
some patients who may 
another without 
hospitalization here. 

Of the 60 patients in the 
received either or both 
treatment. This treatment 
several months after the administration of the 
MMPI. The 11 patients subsequently as 
improved in terms of the 
an average of 11 electrically 
and 8 insulin comas 
figures the unimproved group of 12 patients 
were 14 and 16. The remaining 37 patients in the 
elevated group received general supportive hospital 
care, which often included 
of individual psychotherapy. The composition of 
the various terms of diagnosis, 
and age, is shown in Table 1 


years following discharge 


been 
known, except for 
have been hospitalized in 
state mentioning their previous 
elevated group 23 had 
and electric shock 
tarted from one day to 


insulin 


classified 
used 
induced convulsions 
The corresponding 


criterion received 


eat h 
for 


an appreciable amount 


subgroups in SEX 


PROCEDURE AND RESULTS 


Attempting to determine what is measured 
by the Ps scale, Feldman (1951) has pointed 
out that an unusually of 


large number 


items which refer to interpersonal relation- 
ships are included in his Ps scale, and that 
his “recovered” group, although consisting 
almost entirely of 
tained scores very similar to that of a group 
of normal subjects. Since quality of inter- 


psychotic patients, ob- 


142 





Prediction of Relapse for Psychiatric Patients 143 


TABLE 1 


COMPOSITION OF SAMPLE 


Diagnosis and Sex 


\ffective Schizo Nonpsy 
Age Disorder phrenia* chosis® 
; 
Subgroup \ Mean SD M k M F M F 
Elevated, shock treatment 
Improved 11 35.0 8.8 1 3 1 1 3 2 
Unimproved 12 35.2 7.0 2 0 5 2 1 2 
Elevated, supportive care: 
Improved 20 33.4 11.3 1 1 2 3 4 9 
Unimproved 17 29.9 10.9 1 3 1 3 3 6 
Unelevated, all treatments 
Improved 19 36.3 10.7 3 2 2 3 5 4 
Unimproved 15 35.5 10.1 3 2 1 4 3 ? 





personal relationships enters into most con- Ps distributions obtained in the present 
cepts of adjustment, it may be asked investigation with those obtained by Feld- 
whether an “adjustment” questionnaire might man (1951), some interesting similarities did 
not differentiate equally well between pa- appear. The mean Ps score of the improved 
tients that improve and those that do not. group in this study is about halfway between 
Furthermore, since 38 out of the 52 items the means reported for Feldman’s recovered 
comprising the Ps scale tended to be an- and improved test groups, while the present 
swered “True” by Feldman’s unimproved 

criterion group, there may also be a question TABLE 2 


about the role of response set, specifically of QUESTIONNAIRE SCORES OF THE TOTAI 


“acquiescence,” and about the possibility of ELEVATED Group 
using an acquiescence measure as a sup- 
pressor variable in predicting improvement. Questionnaire Score 
. . ( terion 
As scores on a 142-item Adjustment key for on 
: = ph ? Status V Mean SD t Pinte 

the MMPI (Fulkerson, 1957) as well as on 
a 24-item Acquiescence key, also derived Ps 
from the MMPI (Fulkerson, 1958) were 

‘ . . 2 Improved 31 21.2 8.4 
available for the patients in this sample, 


: 2.68°* 41 
these questions could be followed up at small 


extra cost. 


Unimproved 29 272 89 


7 Adjustment 
As a first step, the total elevated group was oy 


° ° ° ° wD “ 2 559 72 
split into improved and unimproved cases improved at My a 
. . . . - ‘ « 
(according to the criterion described) and Unimproved 29 66.8 164 
the Ps, Adjustment, and Acquiescence scores 
7" - on A cquiescence 
were compared. The results in Table 2 show —— 
significant differences between improved and Improved NS 3. «a 
' ; 2.73* 42 
red patients » case of ; e 
unimproved patients in the case of all three Unimproved 29 140 39 
variables. 
Although the different follow-up criteria * p <.01. 
° ° ° ** »<.005 (one-tailed ¢ tests in the ase of Ps and Adjust 
used precluded a strict comparison of the ment 

















144 


unimproved group obtained a slightly lower 
(more favorable) mean score than his un- 
improved group. The overlap between the 
Ps score distributions of the improved and 
unimproved groups in the present study is 
somewhat greater than that found by Feld- 
man. Inspection reveals that a cutting score 
of 24.5 will maximize the total correct 
predictions of the criterion in the present 
sample. The analogous cutting scores for 
Feldman’s distributions (maximizing total 
correct prediction of his criterion) range be- 
tween 20.5 and 28.5, depending on how the 
dichotomy is obtained. 

The intercorrelations between the Ps, Ad- 
justment and Acquiescence scores of the 
60 patients in the elevated group were: 
Ps-Adjustment, r = .84; Ps-Acquiescence, r 
= .65; and Adjustment-Acquiescence, r = 

The Ps-Adjustment correlation coefficient 
is of the same order as the corrected split- 
half reliability coefficient of .86 reported 
for the Ps by Feldman (1951). With only 
12 items common to these two scales, which 
were derived under widely different condi- 
tions, such a high correlation is remarkable. 
The high acquiescence loadings (if we assume 
for the moment the validity of the Acqui- 
escence scale used) of both the Ps and the 
Adjustment scales are according to expecta- 
tion. Unfortunately, the fact that the 
Acquiescence scale discriminates as well as 
do the other two between the improved and 
unimproved patients, rules out the possibility 
of using it as a suppressor variable. 

It remained te be seen whether prediction 
of relapse was equally effective for the group 
who had received general supportive hospital 
care as for the patients who had received 
insulin and/or electric shock treatment. 
Splitting the elevated group into supportive 
care and shock treatment subgroups, it was 
found that the Ps scale discriminated equally 
well (p < .025) between improved and un- 
improved patients in both treatment sub- 
groups. This suggests that what the Ps 
scale measures is not so much ability to 
benefit from shock treatment as the tendency 
to get well irrespective of type of treatment. 
Incidentally, the shock treatment subgroup 
obtained a slightly lower (more favorable) 


c? 
= J. 


David J. Gouws 


mean Ps score (.1 > p > .05) than the sup- 
portive care subgroup. 

A practical limitation of Feldman’s scale 
is that it can only be used with patients 
whose MMPI profiles meet the stated re- 
quirements. In the total sample of 94 pa- 
tients available for this study, 34 (or 36%), 
are thus excluded from consideration. It 
seemed worthwhile to investigate whether the 
relationship observed between Ps scores and 
improvement in the elevated group would 
not be found in the unelevated group as 
well. Comparison of the Ps scores of the 
improved and unimproved patients in the 
unelevated subgroup yielded no significant 
difference, however, neither did the Adjust- 
ment or Acquiescence scores differ signifi- 
cantly, although all three differences were in 
the same direction as for the elevated group. 
To estimate the influence of truncation of 
scores on the validity of the Ps scale, biserial 
r was calculated and corrected for restriction 
of range. The r obtained was .22 and it rose 
to .31 (p> .05) after correction. 

Feldman has speculated about the type 
of patient who is ill enough to be in a 
psychiatric hospital, yet responds essentially 
like a normal person to psychological ques- 
tionnaires. Two incidental but interesting 
observations on the unelevated subgroup in 
this study should be reported: Of 8 patients 
diagnosed as “psychopathic personality” or 
“psychopathic state” in the original sample 
of 104 cases, 5 were in the unelevated sub- 
group. Secondly, 12 patients out of the 34 
in the unelevated subgroup attended college, 
and 8 of them graduated. The corresponding 
numbers for the elevated subgroup (N = 60), 
were 9 and 2. These differences, tested by 
x’, with Yates’ correction, were significant 
at the .05 and .01 level for college attendance 
and graduation, respectively. 


DISCUSSION 


Although a significant difference between 
improved and unimproved patients 
found, the differentiation in terms of Ps 
scores was not marked enough to enable 
reliable prediction for the individual patient 
to be made except in a small minority of 


was 











Prediction of Relapse 


extreme cases. In evaluating the less im- 
pressive discrimination obtained—as com- 
pared with the data reported by Feldman 
(1951)—the difference between the improve- 
ment criteria used in the two studies, as well 
as the long period that elapsed between 
taking the MMPI and starting shock therapy 
in the case of some patients in this study, 
should be taken into account. Whether this 
latter point made any real difference is 
questionable, as the prediction was shown to 
hold irrespective of treatment received. It 
does suggest that the attribute(s) tapped are 
reasonably stable in time. 

Feldman (1951), discussing his own cross- 
validation findings, has that the 
Ps scale measures “propensity to improve” 
irrespective of diagnosis or method of treat- 
ment. The present findings, that the Ps 
scale predicts improvement equally well for 
shock treatment and for supportive care pa- 
tients, and that an Adjustment scale, de- 
veloped for a different purpose, can predict 
as well as the Ps scale, do seem to confirm 
the notion that a general characteristic or 
group of characteristics, rather than a specific 
characteristic, namely, 
shock treatment, is being measured by these 
scales. That an acquiescence measure, the 
which were chosen so as not to 
discriminate between well and poorly ad- 
justed military personnel, predicts improve- 
ment so well, indicates that what Feldman 
has tentatively labeled “propensity to im- 
prove” may be a complex entity. 


suggested 


responsiveness to 


items of 


for Psychiatric Patients 





SUMMARY 

In a cross-validation study Feldman’s Ps 
MMPI scale—for the prediction of response 
to shock treatment—was found to discrimi- 
nate significantly between patients who had, 
and did not have, a relapse within 5 years, 
irrespective of treatment. An Adjustment 
scale, developed for military personnel, dif- 
ferentiated equally well between the criterion 
groups. These two scales intercorrelated .84, 
and, respectively, correlated .65 and .53 with 
an Acquiescence scale. The possible use of 
the Acquiescence scale as a suppressor vari- 
able was explored and the implications of 
these data for prognosis of psychiatric 
patients discussed. 


REFERENCES 

Fe_pMan, M. J. A 
therapy. Psychol 
No. 327). 

FuL_kerson, S. C. Adaptability screening of flying 
personnel: Research on the Minnesota Multi- 
phasic Personality Inventory. USAF Sch. Aviat 
Med. Rep., 1957, No. 57-106. 

Furxkerson, S. C. An acquiescence key for the 
MMPI. USAF Sch. Aviat. Med. Rep., 1958, No 
58-71. 

Pumroy, D. K., & Kocan, W. S. A validation of 
measures that predict the efficacy of shock 
therapy. J. clin. Psychol., 1958, 14, 46-47 

Roserts, J. M. Prognostic factors in electroshock 
treatment of depressive states: II. The applica- 
tion of specific tests. J 1959, 105, 703 
713. 

Wino te, C. Psychological tests in psychopathological 
prognosis. Psychol. Bull., 1952, 49, 451-482 


(Received March 11, 


prognostic scale for shock 
Monogr., 1951, 65(10, Whole 


ment. Sci., 


1960) 

















Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 146-148 


THE REPRESENTATION OF PHYSIQUE IN CHILDREN’S 
FIGURE DRAWINGS 


A. B. SILVERSTEIN ! 


Pacific State Hospital 


The interpretation of human figure draw- 
ings as a projective technique is said to rest 
on the assumption that the drawn figure 
represents the subject’s body image—‘the 
picture of his body which he forms in his 
mind” (Schilder, 1950). Much of the re- 
search purporting to test the “body image’”’ 
hypothesis has made use of physically dis- 
abled subjects; investigations of this kind 
were reviewed in a previous paper (Silver- 
stein & Robinson, 1956). A search of the 
literature has revealed but little work based 
on subjects within the normal range of 
physical variation. Berman & Laffal (1953) 
reported that the predominant somatotype 
of drawn figures was related to that of the 
men who drew them;? and Kotkov and 
Goodman (1953) found that figures drawn 
by obese women tended to cover a greater 
area of the page than those drawn by women 
of ideal weight. 

In a review of empirical evidence on 
figure drawings, Swensen (1957) questioned 
the treatment and interpretation of the data 
of both of these studies, but even if their 
seemingly positive results are accepted at 
face value, it should be noted that neither 
study provided a direct test of the body 
image hypothesis. The same is true of investi- 
gations of the figure drawings of the physi- 
cally handicapped. When previous research is 
considered from an operational viewpoint, it 
is immediately apparent that its focus has 
been the relation between the drawn figure 

1 Formerly at the Psychiatric Institute, University 
of Maryland, where the data for this study 
collected. 

2 As reanalyzed by the present authors, the data 
presented by Berman and Laffal do not show a 
significant relation between the somatotype of the 
drawn figures and that of the subjects (x? = 6.53, 
df=4, p> 05). 


were 


AND H. A 


ROBINSON 


Psychiatric Institute, University of Maryland 


and the actual structure of the body, not the 
body image. Subjects have been selected not 
on the basis of differences in body image, but 
because they differed with respect to actual 
physique.* While it may be true that nor- 
mally there is no discrepancy between the 
body image and the body structure there 
seems to be no empirical evidence at present 
to support this common assumption. To the 
extent that the subject’s “mental picture” 
of his physique does not correspond to his 
actual physique, the relations and differences 
observed in previous research on the body 
image hypothesis are clearly in error. 

To the writers’ knowledge, the study re- 
ported here is the first to distinguish opera- 
tionally between image and body 
structure. The method made it possible to 
assess the degree of correspondence between 
body image and body structure, and to per- 
form a direct test of the body image hy- 


body 


pothesis, i.e., to relate the drawn figure to 
an independent measure of body image. Since 
Buck (1948) and others have suggested that 
the drawn figure may represent the subject’s 
“body ideal” as well as or instead of his 
body image, a measure of this construct was 
also included in the study. 


PROCEDURI 


The subjects were 30 boys and 30 girls, selected 
from a total sample of 97 sixth grade public school 
children so as to equate for age (mean 11-7, range 
11-2 to 12-2). The conventional drawing procedure 
was followed during a session. The 
children were first asked to draw a person—a whole 
person, and then a person of the opposite sex 


regular class 


> Silverstein and Klee (1958) conducted an ex- 
perimental test of the body image hypothesis in 
which body structure was not the basis for select- 
ing subjects, but in this study, 
measure of body image was 


too, no independent 
employed 


146 








Physique in Children’s Figure Drawings 


from the first. No time limits 
task, and no further instructions were given 

When the drawings had been completed, a 
questionnaire was administered 
of body image which would be experimentally in 
dependent of measures of body structure, the chil- 
dren were asked to estimate their height and weight 


were set for the 


brief 
To obtain measures 


For measures of body ideal, they were asked to 
state how tall they would like to be and how 
much they would like to weigh (at the present 
time) if their height and weight could be changed 


Finally, the children were weighed and their heights 
measured. 

The height of each of the 
measured to the nearest 0.1 
of its volume—the 


drawn figures was 


inch; and an estimate 


height of the figure multiplied 
by the square of its width at the waistline (also 
measured to the nearest 0.1 inch)—was taken to 


represent its “weight.” 
RESULTS 


The first step in analyzing the data was to 
assess the degree of correspondence between 
body image and body structure. For this 
purpose, Pearson product-moment correla- 
tions were calculated between estimated and 
actual measures, and the significance of the 
differences between the means of these meas- 
ures was evaluated using the ¢ test for cor- 
related measures. The findings are shown in 
Table 1.4 


TABLE 1 
CORRELATIONS AND DIFFERENCES BETWEE> 
ESTIMATED AND ACTUAL MEASURES 
V=00 
Estimated Actual 
Measure Mean SD Mea Sp 
Height in inct 58.3 348 594 »9? .86** 4.57% 
Weight in pounds 92.0 16.37 94.1 18.56 .95** 274 





* Significant at .01 level 
** Significant at .001 level 


The magnitude of the correlation coeffi- 
cients indicates a rather close correspondence 
between body image and body structure as 
far as height and weight are concerned, the 
correspondence being particularly close in the 
case of weight. Since the correlations are not 
perfect, however, there is at least the pos- 
sibility that estimated and actual measures 
may be differentially related to measures of 

* Data for boys and girls were initially analyzed 
separately, but results for the 


virtually identical, and so they have been combined 
in the interest of economy of presentation 


two groups were 


147 
rABLE 2 
CORRELATIONS BETWEEN ESTIMATED, ACTUAL, AND 
IDEAL MEASURES, AND MEASURES OF SAME-SEX 
AND OpposiITeE-Sex FiGures 
N = 00) 
Figure Estimated Actual Ideal 
Height 
Same-Sex 33°° 22 OY 
Opposite eX 21 11 04 
Weight 
Same-Sex ) ta 23 19 
Opposite-Sex 13 12 18 
* Significant at .05 level 
* Significant at .01 level 
the drawn figures (McCornack, 1956). The 


data of Table 1 reveal a significant tendency 
for the children to underestimate their height 
and weight. The most parsimonious interpre- 
tation of this finding appears to be that in 
this period of rapid growth, the children’s 
knowledge of their height and weight is soon 
outdated. 

The correlations between estimated, actual, 
and ideal the one hand, and 
measures of the same-sex and opposite-sex 
figures, on the other, are given in Table 2. 
The two coefficients which reach the conven- 
tional criteria of statistical significance repre- 
sent correlations between estimated height 
and weight, and the estimated height and 
weight of the same-sex figure. Contrary to 
expectations, however, both of these coeffi- 
cients are negative, a finding which at 
value suggests an inverse relation between 
body image and the drawn human figure! 


measures, on 


lace 


DISCUSSION 


It is not at all clear why the subject’s 
mental picture of his physique should be 
inversely related to the physique of his drawn 
figure. We are reluctant to invoke 
concepts as compensation, reaction 


such 
forma- 


tion, or contrast projection, for there seems 
to be no theoretical basis for attempting such 
“dynamic” interpretations of the present re- 
sults; nor do we possess additional informa- 
tion on the subjects of this study which 
might provide independent support for inter- 





148 


pretations of this kind. Under these circum- 
stances, we prefer to offer no ad _ hoc 
explanation for the findings that estimated 
measures proved to be somewhat better pre- 
dictors than did actual and that 
the relations observed held for same-sex but 
not for opposite-sex figures. Whatever the 
interpretations of the data, it is clear that 
they are not consistent with a body image 
hypothesis which calls for a direct represen- 
tation of the body image in human figure 
drawings. 


measures, 


SUMMARY AND CONCLUSIONS 


Human figure drawings were obtained 
from 60 sixth grade boys and girls, after 
which they were given a questionnaire de- 
signed to provide measures of body image 
and body ideal. Finally, the children were 
weighed and measured. Estimated height and 
weight were highly correlated with actual 
height and weight, indicating a close corre- 
spondence between body image and body 
structure. When actual, estimated, and ideal 
measures were correlated with corresponding 
measures of the drawn figures, small but 
significant megative correlations were ob- 
tained between estimated measures and 
measures of the figures. None of the other 


A. B. Silverstein and H. A. Robinson 


correlations was significant. Without more 


data than are presently available, these find- 
ings are difficult to interpret, but in any 
case they are not consistent with the assump- 


tion that the drawn figure directly represents 
the subject’s body image. 


REFERENCES 


BerRMAN, S., & LarraL, J. Body type and figure draw- 
ing. J. clin. Psychol., 1953, 9, 

Buck, J. N. The H-T-P 
and quantitative scoring manual 
1948, 4, 317-396 

Kotrxov, B., & Goopman, M. The Draw-a-Person 
tests of obese women. J. clin. Psychol., 1953, 9 
362-364. 

McCornack, R. L 
item-weighting 
40, 343-344 

ScHILDER, P. The image and 
human body. New York 
Press, 1950 

Srrverstern, A. B., & Kez, D 
cological test of the 
J. nerv. ment. Dis., 1958, 

SILVERSTEIN, A. B., 


368-370 
A qualitative 
J. clin. Psychol., 


technique: 


A criticism of studies comparing 
methods. J. appl. Psychol., 1956, 


appearance of the 
International Univer 


4 psychopharma- 
image” hypothesis 
127, 323-329 
& Rosinson, H. A. The repre- 

sentation of orthopedic disability in children’s 

figure drawings. J. consult. Psychol., 1956, 20, 

333-341. 

Swensen, C. H., Jr. Empirical evaluations of human 

figure drawings. Psychol. Bull., 1957, 54, 431-466 

(Received 


March 11, 1960) 





Journal of Consulting Psycholog 
1961, Vol. 25, No. 2, 149-154 


INDEPENDENCE TRAINING AND FIRST GRADERS’ 
ACHIEVEMENT 


JUNE ELIZABETH CHANCE 


University of Missouri 


Psychological research regarding parental 
attitudes and child rearing patterns has been 
mainly concerned with their general effects on 
personality development of the child. Studies 
of specific effects of parent-child interaction 
on the child’s academic achievement are rarer, 
although one might suppose that if varia- 
tions in parent attitudes and practices pro- 
duce variation in child personality, these 
variations would be reflected in 
in part account for 
achievement. 

Results of existing studies of influence of 
parental variables on school performance 
suggest, but not unequivocally, that devia- 
tions in parent-child relationships are related 
to deviations in school (Hatt- 
wick & Stowell, 1936; Swenson, 
1951; Levy, 1933, 1943). These studies, 
however, are based upon situations where 
there are extremes in either child or parent 
behavior and reveal little about what might 
be true of more typical situations. 

Hypotheses offered by McClelland (1958) 
regarding the role of early learning in 
formation of the achievement motive sug- 
gest relationships between certain facets of 
parental behavior and the child’s motivation 
toward his school performance. Winterbottom 
(1958), in a study within the McClelland 
framework, found that earlier demands by 
mothers for independence were 
related to higher need achievement in 8- 
year-old boys. She did not, however, find 
differences between her groups in actual 
school achievement as assessed by teacher's 
ratings. Her study, focused as it was on 
achievement fantasy, did not provide neces- 
sary controls of the other pertinent vari- 
ables which might affect actual achievement. 

In a recent study of mothers of deaf 


and might 
SC hool 


-variations in 


achievement 
Kurtz & 


behaviors 


children, Gordon (1959) explored the rela- 
tion of mothers’ independence training atti- 
tudes to disparities between the children’s 
intellectual ability and the extent to which 
they had actually accomplished certain de- 
velopmental tasks. He found that mothers 
of high-potential—accomplishment - disparity 
deaf children favored earlier independence 
training more than did mothers of 
disparity children. 


low 


d’Heurle, Mellinger, and Haggard (1959) 
in a study of personality, intellectual, and 
achievement patterns of gifted third grade 
children found small positive correlations be- 
tween overprotectiveness of parents and 
arithmetic, reading, and general achievement 
scores. They also found a positive relation- 
ship between parental pressures toward 
achievement and achievement test scores. 

The present investigation asked whether 
within a group of first grade children, who 
were not retarded academically nor disturbed 
emotionally, were individual differences in 
school progress related to differences in 
mothers’ attitude toward independence train- 
ing? The study was performed as part of a 
larger assessment program of first grade 
children. Medical and psychiatric data, as 
well as psychological and achievement test 
results were available for each child. School 
records, the supervisor of instruction, teach- 


ers, and the physician supplied general in- 
formation about and impressions of home 
situations. In addition, for the children of the 


1 These data were collected while the author was 
at the University of North Carolina. The author 
wishes to express her appreciation to Bernice Wade, 
Supervisor of Instruction for the Chapel Hill Public 
Schools, to the teachers, and to Kempton Jones 
and John Filley whose cooperation made this study 
possible. 


149 





150 


total sample who are included in the present 
investigation, a questionnaire filled out by 
the mother regarding her age expectations for 
achievement of independence behaviers was 
obtained. The Winterbottom (1958) ques- 
tionnaire was adapted for this purpose. 


METHOD 


The children in the first grade class at School A 
numbered 13 girls and 17 boys; the children at 
School B, 19 girls and 16 boys. Only children enter- 
ing first grade for the first time were included in 
the study. Of the 65 children available for study, 
37 were from homes both parents 
were engaged in occupations related to the local 
university. Parents of the remaining children were 
engaged in ranging from 
to owners or managers of small 
skilled workers. 

All children administered the Revised 
Stanford Binet Scale, Form L, between December 1 
and April 1 of the school year. In May of that 
year, both classes were given the Stanford Achieve- 
ment Test, Primary Battery, Form N. The psychi- 
atrist interviewed about each child 
in her class in arrive at a_ psychiatric 
evaluation. The physician compiled available data 
to obtain a picture of the medical history and cur- 
rent physical status of each child. Each teacher 
described all of her children by means of a behav- 
ioral checklist. In April, the present investigator 
mailed to each mother of the children being studied 
a brief questionnaire composed of some identifying 
questions, the 20 items of Winterbottom’s independ- 
ence training questionnaire, and 8 other items of 
a similar type devised by the investigator. A cover- 
ing letter told mothers they had been selected as a 
sample to represent mothers of school-age children 
in the community. Instructions emphasized confi- 
dentiality and research use of the data. Mothers 
were asked to return the questionnaire even if they 
felt they could not answer every item. They were 
asked to indicate for each item of the questionnaire 
the approximate age at which they expected their 
children to do what the item described and also 
to check the items they felt were especially im- 
portant goals of their child rearing.” 

The covering letter invited mothers to phone the 
investigator if they needed further clarification of 
the questionnaire. Of the five mothers who called, 
in only one instance did there seem to be genuine 
confusion about the questionnaire itself. The others 
seemed merely curious or in need of reassurance. 
Of 65 questionnaires sent, 52 were returned. The 
results reported here are based on data from these 
52 complete cases. 


where one or 


professions 
and 


occupations 
businesses 


were 


each teacher 


order to 


* Verbatim copies of the instructions and cover- 
ing letter are available from the author on request. 


June Elizabeth Chance 


rABLE 1 


MEANS AND STANDARD DEVIATIONS OF INTELLIGENCE, 
\CHIEVEMENT, AND INDEPENDENCE TRAINING 
MEASURES FOR ALL SUBJECTS 


Measure 


Intelligence Quotient 
evised Stanford 
Binet, Form L) 

Reading Achievement 
Stanford Achievement 
Test, Primary Battery 
Form N) 

Arithmetic Achievement 
Stanford Achievement 
Test, Primary Battery 
Form N) 


Revised Independence 
Training Questionnaire 


TREATMENT OF DATA AND RESULTS 


Table 1 presents a summary of means and 
standard deviations of the measures available 
for each of the 52 subjects. No child in this 
group had any marked physical or sensory 
handicap, nor were any of these children 
from homes broken by death or divorce. 
Observation and psychiatric evaluation sug- 
gested that 15 children in the total group 


of 65 showed some mild degree of personality 


disturbance. Two of these children are in- 
cluded in the 52 of this study—both appear 
in the early independence training subgroup. 

Examination of Table 1 reveals that the 
studied was superior in both intel- 
lectual ability and school achievement. Tests 
of mean differences in intelligence scores, 
reading and arithmetic achievement scores, 
and independence training scores indicated 
no significant differences between children in 
the two different hence the two 
school groups are combined in Table 1 and 
in all further analyses of the data. 

Since 8 a priori items had been added to 
the 20 items of Winterbottom’s independence 
training questionnaire, an item analysis of the 
whole questionnaire was performed. Mothers 
were divided into groups favorable to early or 
late training on the basis of their average age 
of demand scores for the entire questionnaire, 
i.e., the sum of all ages given divided by the 
number of items answered. (Not all mothers 
answered all items; all answered at least 23 
of the 28.) Age estimates for each item given 


group 


SC hools. 





Independence Training and Achievement 


rABLI 


RANGES AND MEDIANS oF AGE Estimates GiveN By ALL Moruers To Lrems or THE INDEPENDEN( 
TRAINING QUESTIONNAIRE AND SIGNLIFICANCES OF DIFFERENCES IN RESPONSE BETWEEN MOTHERS 


FAVORABLE TO EARLY AND THOSE FAVORABLE TO LATE TRAINING 


Range of Age Median Age Value* of 
Item Estimates Estimate x? 


1. To stand up for his own rights 
with other children 


2. To know his way around his part 


of the community so he can play 
where he wants without getti: 
lost 


To go outside to play when he 
wants to be noisy or boisterous 


To be willing to try new things 
on his own without depending 
on his mother for help 


To be active and energetic in 
climbing, jumping, and sports 


To show pride in his own abilit 
to do things well 


To take part in his parents 


interests and conversations 


To try hard things for himself 
without asking for help 


To be able to eat alone without &.39*** 
help in cutting and handling foo 


. To be able to lead other children 16.48**** 
and to be able to assert himself 
in children’s groups 


To make his own friends among 17.73**** 
children of his age 


To hang up his own clothes and 3 31 
to look after his own possessions 


To do well in school on his own 16 
To be able to undress and to go 1.30 
to bed by himself 


To have interests and hobbies ; 9.62*** 
of his own—be able to entertain 

himself 

To earn his own spending money 2 1.80 
To do some regular tasks around 2.84* 
the house 

To be able to stay at home dur 5 8.39*** 
ing the day alone 

To make for himself decisions 3 13.20**** 
like choosing his clothes or how 

to spend money for toys, hobbies, 

recreations, etc. 








152 June Elizabeth Chance 
rABLE 2—(Contlinued) 
Range of Age Median Age Value* of 
Item Estimates Estimate x? 
20. To do well in competition with 1-10 6 Sa 
other children—to try hard to 
come out on top in games and 
sports 
21. To be satisfied to stay with 1- 6 3 09 
someone he knows well when 
parents must be away for a few 
days 
22. To decide upon and to purchase 3-10 7 10.48*** 
small gifts with his own money 
for family members and close 
friends 
23. To hold short conversations with 4-10 5 4.63** 
grown-up friends who come to 
visit the family 
24. To visit and to stay overnight 5- 9 7 89 
with a playmate 
25. To straighten out most of his 4-10 6 3.37* 
difficulties with other children 
without adult intervention 
26. To be interested in obtaining 6-12 7 3.96** 
good grades in school 
27. To take part in group activities 6-11 8 5.48** 
such as clubs, scouts, etc. 
28. To read a simple story or comics 6- 8 7 14 
by himself 
* All values of chi square were computed with Yates correction for continuity 
*p<.10 
** > <.05. 
rt > <.01. 
788 > <.001. 
by mothers favorable to early training were 19 items showing significant differentiation 


then classified as above or below the median 
age estimate given by all mothers for that 
item. Age estimates given by mothers favor- 
able to late training were similarly classified. 
Significances of differences in responses to 
items by the two groups of mothers were 
tested by means of 2 x 2 chi squares. The 
complete list of the independence training 
questionnaire items is given in Table 2, along 
with ranges and median age estimates for the 
total group of mothers. Table 2 also con- 
tains the values of chi square and the sig- 
nificance levels of those items answered 
differently by mothers favorable to early 
independence training and by those favorable 
to late. Rescoring each mother’s question- 
naire using only the responses given to the 


between the two groups of mothers a revised 
independence training score (RIT) was ob- 
tained. The RIT summarized in 
Table 1 are average age of demand scores, 
i.e., the sum of estimates given divided by 
the number out of the 19 items responded to. 

In order to investigate the effects of in- 
dependence training on achievement it was 
necessary to hold constant differences in 
intellectual ability. To do this, the distribu- 
tions of intelligence scores and of reading 
and arithmetic achievement scores were con- 
verted into ranks and thence into standard 
rank scores. Using these converted scores, a 
disparity score was computed for each child 
in reading and in arithmetic. That is, the 
difference between his ranking in the total 


scores 








Independence Training and Achievement 1 


group on the basis of each of his achievement 
scores and his ranking in the group on the 
intelligence measure was obtained. If a 
child’s intelligence rank and achievement 
rank were equal his score would be zero. 
If his intelligence rank exceeded his achieve- 
ment rank his score would be negative, or 
if his achievement rank exceeded his intel- 
ligence rank his score would be positive. In 
order to remove the negative scores a con- 
stant of 30 was added to the scores sum- 
marized in Table 3. In Table 3, therefore, 
scores below 30 indicate achievement rank 
below intelligence rank, while scores above 30 
indicate the converse. 

Subjects were then divided into early and 
late independence training groups on the 
basis of mothers’ scores on the RIT question- 
naire and mean differences in disparity 
scores were tested using Fisher’s ¢. The 
analyses were done first for girls and boys 
separately and then for combined groups. 
The results of these tests appear in Table 3. 

Differences between early and late inde- 
pendence training groups are statistically 
significant in every case and in a direction 
suggesting that children whose mothers favor 
earlier demands for independence make poorer 
school progress relative to their intelligence 
level than children whose mothers favor later 
independence demands. The differences ap- 
pear, at least superficially, to be more marked 
in girls than in boys and more marked in 
reading than arithmetic. 


TABLE 3 


DISPARITY BETWEEN Group RANKING IN INTELLIGENCE 
SCORES AND RANKING IN ACHIEVEMENT FOR EARLY 


AND LATE INDEPENDENCE TRAINING Groups 
Early Group Late Group 
Achievement Mean SD Mean SD Value of ¢ 
Reading 
Girls (24) 20.50 9.98 38.50 9.50 4.32** 
Boys (28) 25.43 9.43 35.29 8.06 2.81** 
Both (52) 23.15 9.84 36.77 9.07 15.76*** 
Arithmetic 
Girls (24) 19.33 16.33 37.67 7.39 3.22** 
Boys (28) 26.71 10.52 35.14 4.53 2.60" 
Both (52) 23.31 14.29 36.31 6.69 4.10*** 


*p<.02. 
> <.01 
+ > <.001 


ws 
w 


Tests of the frequency of items checked by 
mothers on the questionnaire as specially im- 
portant to them failed to show any significant 
relation to other measures. Similarly an hy- 
pothesis that early and late training groups 
might contain differential numbers of chil- 
dren from university-related homes was not 
substantiated. 


DISCUSSION 


While the findings of this study confirm 
those of Gordon (1959) with mothers of 
deaf children, they are contrary to those 
which might be expected from Winterbot- 
tom’s study (1958) and from McClelland’s 
(1958) hypotheses about age of independ- 
ence training and n Achievement. However, 
this study concerns actual achievement, while 
Winterbottom and McClelland are concerned 
with motivation to achieve. The multitude 
of variables which operate and interact to 
produce differences in actual achievement are 
exceedingly complex. However, the fact that 
mothers’ attitudes toward earliness of inde- 
pendence training and actual achievement are 
inversely related in these findings where 
healthy, psychologically sound youngsters ‘ 
from a fairly homogeneous social group were 
studied suggests that relations among ma- 
ternal attitudes toward independence train- 
ing, n Achievement, and actual achievement 
need to be more extensively investigated. 

If one may generalize from this group of 
“normal” children to children referred to 
clinics for academic difficulties, and if one 
assumes that very early demands for inde- 
pendence behaviors by the mother may be 
experienced by the child as excessive pres- 
sure, the findings here confirm the common 
clinical hypothesis that school failure is a 
means by which the child can express 
resistance to the parent (Vorhaus, 1946). 

Examination of the independence training 
questionnaire from the point of view of what 
other than attitudes toward independence 


® Reanalysis of the data, excluding the two cases 
of “maladjusted” children from the early training 
group, failed to alter the results significantly. One, 
a girl, with an IQ of 146, had a reading disparity 
score of 19 and an arithmetic disparity score of 18; 
the other a boy, with an IQ of 125, had a reading 
disparity score of 36 and an arithmetic disparity 
score of 28. 

















154 


training it might sample suggests that many 
items could be interpreted as related to the 
mother’s need to maintain interpersonal dis- 
tance in her relationship with her child. As 
noted earlier, Kurtz and Swenson (1951) 
found evidence suggesting that parents of 
underachievers might be more distant in their 
relationships with their children than parents 
of overachievers. d'Heurle, Mellinger, and 
Haggard (1959) found both parental over- 
protectiveness and pressures for achievement 
to be positively associated with high achieve- 
ment. Taking all this evidence together the 
following hypothesis is suggested: mothers’ 
attitudes toward early independence training 
will differentially influence the child and his 


subsequent school achievement depending 
upon whether she maintains a close or distant 
interpersonal relationship with him. The 


same hypothesis also can be stated that 
maternal attitudes favoring early independ- 
ence will have different impact upoy the child 
depending upon mother’s motivation for de- 
siring that independence. A possible theo- 
retical distinction between instrumental act 
independence and emotional independence is 
implied. 

The findings of this investigation are 
limited by the selective nature of the situa- 
tion studied: the children’s superior intel- 
lectual ability and small numbers, their 
attendance in a good school system with 
excellent teachers, and their parents’ rela- 
tively high socioeconomic and cultural status. 
A further source of bias is evident in the 
differential return of the mail questionnaires. 
While it is fascinating that of 13 mothers 
who did not return their questionnaires, all 
13 were mothers of children judged by the 
investigators to be somewhat maladjusted, 
it is difficult to see how this bias might have 
influenced the direction of the findings. 


Rather, it seems to define them all the more 
clearly. 


June Elizabeth Chance 


SUMMARY 


The relationship between mothers’ atti- 
tudes toward independence training and 52 
first grade children’s school achievement was 
studied. It was found that children whose 
mothers favored earlier independence train- 
ing made less adequate school progress in 
both reading and arithmetic relative to their 
intellectual ability than children whose 
mothers favored later independence training. 
Implications and limitations of the findings 
are discussed. 


REFERENCES 


pD’HeurLe, ApMaA, MELLINGER, JEANNE C., & Hac- 
GARD, E. A. Personality, intellectual, and achieve 
ment patterns in gifted children. Psychol. Monogr., 
1959, 73(13, Whole No. 483 
Gorpon, J. E. Relationships 
achievement, independenc« 
handicapped children’s 


among mothers’ n 
training attitudes, and 


performance. J. consult 


Psychol., 1959, 23, 207-21 
Hattwick, B. W., & StowetLt, M. The relation of 
parental overattentiveness to children’s work 


habits and social adjustments in kindergarten and 
the first six grades of school. J. educ. Res., 1936, 
30, 169-176 

Kurtz, J. J.. & Swenson, E 
over-achievement and under 
Sch. Rev., 1951, 59, 472-48 

Levy, D. M. Relations of maternal overprotection 
to school grades and intelligence tests. Amer. J. 
Orthopsychiat., 1933, 3, 26-34 

Levy, D. M. Maternal o 
Columbia Univer 

McC.etianp, D. C. The importance of early learn- 
ing in the formation of motives. In J. W. Atkin- 
son (Ed.), Motives in fantasy, action, and society 
Princeton: Van Nostrand, 

VorHAvus, PAuLtne G. Non-reading as an expression 

Rorschach re Exch., 1946, 10. 60 


J. Factors related to 
achievement in school. 


New York: 


ver protec tion 
Press, 1943 


1958 


of resistance 
69 
WINTERBOTTOM, Marian R. The relations of need 
for achievement to learning experiences in inde- 
pendence and mastery. In J. W. Atkinson (Ed.), 
Motives in fantasy, action, and society. Prince 
ton: Van Nostrand, 1958 
(Received 


March 16, 1960) 








Journal of Consulting Psychology 
1961, V 25, No 155-15 


MAGICAL THINKING AND ASSOCIATED PSYCHOLOGICAL 
REACTIONS TO BLINDNESS* 


STANLEY P. ZARLOCK ? 


University of Buffalo 


With the onset of blindness, an individual 
becomes aware that he can no longer execute 
certain sensory-motor responses. A reduction 
in a repertoire of responses often leads to a 
state of trauma. Loss of vision hinders ordi- 
nary locomotion and creates a special prob- 
lem of orientation to physical objects. During 
the initial phase of blindness, even eating 
a simple meal may become a complex task. 
Thus, social adaptation and acceptance of a 
physical disability are largely dependent 
upon the acquisition of new skills and habits. 
Among many physically disabled persons, 
however, blindness becomes an_ intolerable 
state of affairs and results in a temporary 
or permanent regression of ego functions. 

The wish for the restoration of vision may 
cause an individual to rely on an earlier 
mode of ego development where the bound- 
aries between reality and irreality lack ade- 
quate distinction. Consistent with Piaget’s 
(1953) description of the “magico-phe- 
nomenalistic” state of ego development, a 
blind person may place unusual confidence 
in miraculous cures and ascribe magical char- 
acteristics to persons and objects that he 
believes may somehow do away with his 
blindness. Hence, a regression of ego func- 
tions may hinder the learning of new and 
appropriate social habits. 

The present investigation was an attempt 
to shed some light on the nature and extent 
of these magical beliefs and to determine the 


psychological characteristics of personality 


1This paper is a condensation of a dissertation 
submitted in partial fulfillment of the requirements 
for the PhD degree at the University of Buffalo 
The author is especially indebted to B. Richard 
Bugelski for his assistance on the project. 

2Now a staff psychologist at the Veterans Ad- 
ministration Hospital, Lexington, Kentucky. 


which are relevant to the process of social 
adaptation and acceptance of blindness. 

Previous authors have dealt with similar 
problems. In a theoretical paper, Barker, 
Wright, Meyerson, and Gonick (1953) de- 
scribed the psychological reactions of blind 
individuals in terms of Lewin’s (1951) model 
of behavior. These authors stated that blind- 
ness creates a new psychological region in 
which the locations of goals are unknown and 
behavioral routes are unfamiliar. A new 
psychological region, therefore, becomes a 
source of frustration, conflict, and anxiety. 

Blindness also creates a state of helpless- 
ness and dependency. According to Adorno, 
Frenkel-Brunswick, Levinson, and Sanford 
(1950), the “anti-democratic personality” 
has dependency needs that involve feelings 
of “doubt, uncertainty, and momentary lack 
of self-confidence.” They illustrated that 
dependency may be associated with “worry 
about the future, realization of impending 
danger, and feeling absolutely lost.” 

Applying psychoanalytic principles to her 
study, Burlingham (1941) reported the case 
histories of two blind children. After close 
examination, she concluded that the loss of 
sight seriously interfered with the function 
of the ego to test reality. These blind chil- 
dren frequently engaged in fantasies in which 
wishes were fulfilled and certain unpleasant 
aspects of reality were denied. 

Similarly, Deutsch (1940) who studied 28 
persons born blind noticed a readiness to 
give up reality and to escape into fantasy. 
A large proportion of these subjects believed 
a cure would come from a_ supernatural 
power. Thus, Deutsch concluded: 


This expectation that the cure would come from a 
supernatural power belongs partly to the realm of 
fantasy in which there is room for the fulfillment 
of all wishes (p. 124). 


S 





156 


For the present study, a series of hy- 
potheses dealing with the relation between 
the psychological characteristics of the blind 
and social adaptation were formulated. 
Among them were the following : 


1. Social adaptation to a new _ psycho- 
logical region is largely enhanced by ego 
strength, low manifest anxiety, and a positive 
attitude towards blindness. 


2. Failure at social adaptation results in 
magical beliefs as to the power of medicine 
and religion in the treatment of illness and 
disability. 


3. An antidemocratic personality is associ- 
ated with failure in social adaptation. 

4. There are no personality differences 
between socially well-adjusted blind subjects 
and physically normal individuals. 


METHOD 
Subjects 


Fifty-two blind subjects from various regions of 
New York State were studied. Each subject was a 
male between 20 and 45 years of age. Four addi- 
tional criteria were used in the selection of subjects 
total and permanent blindness with no light per- 
ception; at least 10 years of normal vision prior to 
blindness; blind for at least 3 years; with the 
exception of blindness, no other physical disability. 

Twenty-five physically normal subjects were 
selected as a control group for the 25 blind subjects 
who had made an adequate social adjustment to 
their handicap. These groups of subjects were 
matched on the variables of age, intelligence, socio- 
economic background, and religious affiliation 


Scales 


Based on the research of Barker (1948), Bauman 
(1954), Fitting (1953), Meyerson (1953), and 
Raskin (1953), a Social Adjustment Scale was 
developed as a preliminary step to permit measure- 
ment of social adaptability to blindness. The scale 
was a rating device based on a series of check list 
items which indicated the blind individual’s level 
of social maturity or acquisition of appropriate 
skills in 10 problem areas: employment, travel, 
indoor orientation, socialization, communication, 
recreation; eating problems, dressing problems, busi- 
ness problems, and physical hygiene. 

In the development of the Social Adjustment 
Scale, care was taken that each problem area was 
a measure of social and not a measure of psycho- 
logical behavior. For instance, some of the items 
were as follows: gainfully employed for one year, 
uses cane or dog guide to travel about the com- 
munity, reads and writes braille, holds membership 


Stanley P. Zarlock 


in a civic organization, maintains proper etiquette 
during meals, and buys his own clothes. 

Each blind subject was rated on the Social Ad- 
justment Scale by a worker most. familiar 
with him. A rating was obtained by the 
investigator who interviewed each subject and 
members of the family. A reliability coefficient of 
correlation between the two sets of scores was .92 
Applying the Spearman-Brown Prophecy Formula, 
the correlation was raised to .95. 

Personal adaptability, manifest anxiety, and atti 
tudes towards blindness were measured by the 
Barron Ego Strength scale (1953), Taylor Manifest 
Anxiety Scale (1953), and the Fitting Attitude 
towards Blindness Scale (1953), respectively. 

Two separate attitude scales were constructed to 
get at the confidence placed in medicine or religion 
to restore sight. Each scale consisted of statements 
which reflected attitudes from extremely negative 
to extremely positive. On the Religious Scale, for 
instance, the statement, “The healing of lepers as 
mentioned in the Bible is a fairy tale,” was rated 
negative towards religion; whereas, the statement, 
“A perfect spiritual faith would absolutely lift us 
from all physical disease,” was considered extremely 
positive. For each of the two scales, 22 statements 
were selected by Thurstone’s (1951) method from 
original lists of 130; each was rated on an 11-point 
scale. 

Both instruments were administered to 80 under- 
graduate college students. The split-half reliability 
coefficient of correlation was .80 for the Medical 
Scale, .93 for the Religious Scale. The Spearman- 
Brown Prophecy Formula raised the coefficients to 
92 and .96. 

The California F Scale, Form 78, devised as a 
test of dependency and reliance on authority was 
used to measure “antidemocratic personality.” 

An estimate of intelligence was obtained from the 
Vocabulary subtest of the Wechsler-Bellevue scale, 
Form 1 (1944). 

At the end of the psychological test battery, an 
interview supplemented the data obtained from the 
Medical and Religious Scales. During the interview, 
an additional attempt was made to determine the 
attitudes of blind subjects towards “miraculous” 
cures by allowing the subjects to verbalize their 
problems. 


social 


second 


Procedure 


The social worker most familiar with the subject 
was asked to rate him on the scale of social adjust- 
ment to blindness. The ratings of each social worker 
were corroborated by similar ratings obtained from 
a member of the family and a close acquaintance 
Because the scale dealt with behavioral items, there 
was little disagreement could range from 
0 to 30, with higher indicating superior 
adjustment. 

The sequence of test presentation was always 
constant. Test items were read aloud to each blind 
subject and the answers tape recorded. The inter- 
view was recorded verbatim and then rated by two 


scores 


scores 





Psychological Reactions to Blindness 


sophisticated judges along a five-point scale of 
confidence the subject ascribed to medicine and 
religion for bringing about a miraculous cure. A 
rating of 1 indicated a strong confidence and a 
rating of 5, no confidence. The same procedure was 
followed with physically normal subjects except that 
they were not rated for social adaptability. 


RESULTS 

On the Social Adjustment Scale, the scores 
for the 52 blind subjects ranged from a low 
of 3 to a high of 29. The mean score for 
the entire sample was 16.3. The distribution 
of scores approximated a normal probability 
curve. 

As Table 1 indicates, all psychological 
measures correlated significantly (.01 level) 
with scores on the Social Adjustment Scale. 
These correlations support the hypotheses 
that higher ego strength, lower manifest 
anxiety, and a more positive attitude towards 
blindness are important psychological vari- 
ables which are strongly related to a blind 
individual’s ability to make an adequate 
social adjustment. Reliance on the power of 
medicine and religion to restore vision was 
negatively correlated with the blind person’s 
level of social adaptation. Antidemocratic 


personality was also negatively related to 


social adjustment to blindness. When the 
possible effects of intelligence and duration 
of blindness were partialed out, all correla- 
tions remained significant at the .01 level 
with the exception of the one associated 


TABLE 1 


CORRELATIONS BETWEEN SCORES ON SOCIAL ADJUST 
MENT SCALE AND PsyCHOMETRIC TESTS 


score 


Ego Strength Scale 
Manifest Anxiety Scale 
Attitude towards Blindness 
Attitude towards Medicine 
Attitude towards Religion 
California F Scale 


* All significant at .01 level. 


with the Medical Scale for which the con- 
fidence level dropped to .03. 

A comparison of scores on each of the 
psychological tests indicated that no sig- 
nificant differences existed between the 25 
physically normal individuals and the 25 
blind subjects who scored above average on 
the Social Adjustment Scale. 

As can be seen in Table 2, blind subjects 
who scored below the mean on the Social 
Adjustment Scale were frequently rated from 
their interviews as individuals who rejected 
their blindness and believed in miraculous 
cures. Better adjusted blind subjects tended 
to accept their blindness and rejected the 
idea of a miraculous cure. Physically normal 
subjects tended to be neutral. 

The distribution of scores for both blind 
groups as indicated in Table 2 shows the 


TABLE 2 


RATINGS OF THE ATTITUDES TOWARDS 


Medicine 
Group 


Poorly 
Adjusted 
Blind 


Well 
Adjusted 
Blind 


Visually 
Normal 
Subjects 


Note 
two judges. 


Scale values 


MEDICINE AND RELIGION BASED ON 


Negative 


INTERVIEW PROTOCOLS 


Religion 


Towards). Values shown are percentages of ratings by 


‘ 








158 


divergence of observed from expected results 
was significant at the .01 level. For the 
physically normal subjects, the divergence of 
observed from expected results was not 
significant at the .05 level. 

To test for similarity among psychological 
tests, a series of intercorrelations were ob- 
tained. These intercorrelations ranged from 
.04 to .59, with the mean correlation at .30. 
The intercorrelations which fell above the 
mean were: F Scale and Medical Scale .59; 
F Scale and Religious Scale .57; and Medical 
Scale with Religious Scale .38. 


DISCUSSION 


An examination of the results clearly 
shows that social adjustment to blindness is 
closely related to ego strength, manifest 
anxiety, and attitudes towards blindness. To 
a large extent, these psychological variables 
appear to determine the kind of personal 
and social adjustment an individual makes 
to his physical disability. On the other hand, 
the psychological characteristics of an indi- 
vidual may be a reflecton of the amount of 
social adjustment he has made towards his 
handicap. 

The role of ego strength in the process of 
adjustment to blindness becomes more mean- 
ingful when examined in the light of somato- 
psychology (Barker et al., 1953). Orientation 
to a new psychological region often requires 
a change in physical and cognitive behavior. 
Likewise, adjustment to blindness generally 
necessitates a new focus on goals and the 
learning of new skills and habits by which 
these goals are attained. Thus, it seems 
reasonable to assume that personal adapt- 
ability and resourcefulness have an important 
bearing on the reaction of an individual to a 
new psychological region. 

The uncertainty of goals and behavioral 
routes of a new psychological region arouse 
manifest anxiety. On the other hand, the 
development of new skills and habits tends 
to abolish the unfamiliarity of a new region 
and thus results in less manifest anxiety. 

When a person perceives his world as 
competitive and hostile, lack of vision may 
arouse in him a negative attitude towards 
blindness. He may feel that a lack of sight 
places him at a serious disadvantage with 


Stanley P. Zarlock 


other human beings. The person may attempt 
to withdraw from the “struggle” and avoid 
competition against unfavorable odds. 

The individual who perceives the world in 
a less threatening sense generally has a more 
favorable attitude towards blindness. The 
physical disability does not place him in a 
dangerous position; hence, there is no need 
to withdraw from social situations. He ac- 
quires new skills and habits which permit 
him to carry out many of his personal and 
social functions. 

As mentioned at the beginning of the 
study, a blind individual may seek an escape 
route by which he avoids the unpleasant 
aspects of the new psychological region. A 
common escape route, for many of these 
individuals, takes the form of magical beliefs 
towards the recovery of vision. The results 
demonstrate that subjects who resisted the 
acceptance of blindness placed unusual con- 
fidence in the ability of medicine and religion 
to perform miraculous cures. Subjects who 
accepted their physical disability expressed 
more realistic attitudes towards medicine and 
religion. 

Finally, attitudes towards authority have 
an important bearing on the adjustment 
process to blindness. Making an inference 
from the work of Adorno et al. (1950), it is 
probable that the blind individual who has 
an antidemocratic personality finds the new 
psychological environment full of dangerous 
elements. The overwhelming threat may 
cause him to lose self-confidence and depend 
heavily on the support of authority as a 
means of survival. The less authoritarian- 
minded person probably does not perceive 
the environment quite as threatening. Since 
extremely dangerous elements are not pres- 
ent, he is able to maintain self-confidence 
and achieve levels of independent behavior. 

In the area of magical thinking, intel- 
ligence had some influence on the formation 
of attitudes towards miraculous cures in 
medicine. However, intelligence had almost 
no effect on attitudes towards miraculous 
cures in religion. 

Many intelligent blind individuals eventu- 
ally adopt realistic attitudes towards medi- 
cine. However, if acceptance of blindness 
does not accompany a realistic attitude 








Psychological Reactions to Blindness 


towards medicine, these individuals may dis- 
cover new areas into which they project their 
magical beliefs. Having failed to regain 
vision, they may shift their magical beliefs 
from ophthalmology to religion or to some 
other frame of reference through which they 
hope to experience a miraculous cure. 

A comparison of the F Scale with both 
the Medical and Religious Scales revealed 
comparatively high correlations. These cor- 
relations suggest the presence of a common 
psychological factor. The three scales un- 
doubtedly measure personal reactions to au- 
thority. Medicine and religion could be con- 
sidered as specific domains of authority; 
whereas, items on the F Scale pertain to au- 
thority in a general sense. Thus, attitudes 
towards authority in the broad sense may be 
identical with attitudes towards authority in 
specific areas. 


SUMMARY 


Fifty-two blind subjects were rated on the 
Social Adjustment Scale and then tested for 
ego strength, manifest anxiety, attitudes 
towards blindness, attitudes towards the 
efficacy of medicine and religion to restore 
health, and degrees of antidemocratic per- 
sonality. Each blind subject was interviewed 
so that additional data could be obtained 
about attitudes towards a miraculous cure of 
blindness. Twenty-five physically normal 
subjects were matched with 25 blind subjects 
who scored highest on the Social Adjustment 
Scale. The control subjects followed the same 
procedure as that of the blind subjects but 
were not rated for social adaptability. 

The results indicated that social adapta- 
tion to blindness was related to high ego 
strength, low manifest anxiety, and a positive 
attitude towards blindness. Blind subjects 
who were socially maladjusted placed un- 
usual confidence in the efficacy of medicine 
and religion to restore health and strongly 


159 


believed they would regain vision through a 
miraculous cure. Subjects who were poorly 
adjusted to the environment were charac- 
terized as more antidemocratic in their per- 
sonalities than »lind subjects who had made a 
good social adjustment to their handicap. 

No significant differences were obtained 
between socially adapted blind and physi- 
cally normal subjects. 


REFERENCES 


Aporno, T. W., Frenket-Brunswick, Ese, Levin- 
son, D. J., & Sanrorp, R. N. The authoritarian 
personality. New York: Harper, 1950 

Barker, R. G. The social psychology of 
disability. J. soc. Issue 

Barker, R. G., 
Gontck, M. R 


physical 

, 1948, 4(4), 28-38. 
Wricut, B. A., Meyerson, L., & 
Adjustment to physical handicap 


and illness: A survey of the social psychology of 
physique and disability. (Rev. ed.) New York 
Social Science Research Council, 1953 

Barron, F. An ego-strength scale which predicts 


response to psychotherapy. J. 
1953, 17, 327-333. 

Bauman, M. K. Adjustment to blindness. Harris- 
burg, Pa.: State Council for the Blind, 1954. 

BuRLINGHAM, D. Psychic problems of the 
Amer 1941, 2, 43-85 

Devutscn, F. The sense of reality in persons born 


consult. Psychol., 


blind. 
Imago, 


blind. J. Psychol., 1940, 10, 121-140 
Fittinc, E. A. Evaluation of adjustment to blind- 
ness. New York: American Foundation for the 


Blind, 1953 

Lewin, K. Field theory in 
Harper, 1951. 

Meyerson, L. The visually handicapped 
Res., 1953, 23, 476-491 

Piacet, J. The construction of reality in 
New York: Basic Books, 1953 

Raskin, N. J., & Wetter, M. F 
in work for the blind. New 
Foundation for the Blind, 1953 

Taytor, J. A. A_ personality 
anxiety. J. abnorm or 


285-290 


ocial science. New York: 


Rev. educ. 


the child 


research 
American 


Current 


York 


SC ale ot 
Psychol., 


maniiest 


1953, 48, 


Tnvurstone, L. L., & CHave, E. J. The measurement 
of attitude. (2nd ed.) Chicago: Univer 
Press, 1951. 

Wecuster, D. The 

Baltimore 


Chicago 


measurement of adult 


Williams & Wilkins, 1944 


intel- 
ligence 


Received March 17, 1960) 








Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 160-164 


SPEECH DISTURBANCE AND JUDGED ANXIETY 


DONALD S. BOOMER anv D. WELLS GOODRICH ! 
National Institute of Mental Health 


In the comparatively brief history of sys- 
tematic research in psychotherapy, verbal 
measures have been given a great deal of 
attention. Much of this work has taken the 
form of developing systems of categories 
for classifying manifest content. A survey 
of this literature by Auld and Murray (1955) 
lists 99 content analysis studies, all of them 
dealing in one way or another with what 
people say. 

Although it is generally recognized that 
how people talk is also a significant aspect 
of their speaking behavior, comparatively 
little attention has been devoted to the 
formal, structural, and expressive aspects of 
speech. The major variables which have been 
systematically studied include: rate of speech 
(Goldman-Eisler, 1956; Henze, 1953; Lass- 
well, 1935), the interaction chronograph 
variables developed by Chapple (1939), 
microlinguistics (McQuown, 1957; Pittenger 
& Smith, 1957), and speech disturbances 
(Dibner, 1956; Mahl, 1956). 

The present paper is based on a replica- 
tion of the Mahl study cited above. Since 
this study will only be summarized here, 
interested readers should consult the original 
report for details. In brief, Mahl presented 
an empirically derived set of categories of 
disturbance which occur frequently in speech. 
The categories are: (a) “ah,” (b) sentence 
correction, (c) sentence incompletion, (d) 
repetition, (e) stutter, (f) incoherent sound, 
(g) tongue-slip, (4) omission of a word or 
part of a word. In a subsequent report Mahl 
(1959) excluded “ah” from this list. Our 
own experience leads us to concur in this 


1We gratefully acknowledge the contributions 
made to this research by several colleagues: Janet 
Barclay, principal research assistant; Allen T. 
Dittmann, Leonard Duhl, and Joseph Handlon, 
clinical judges; Sam Greenhouse, statistical con- 
sultant; and Charles Odell, research assistant. 


exclusion; hence the speech disturbance data 
in this paper will refer only to categories 
b-h, which Mahl terms the “non-ah” disturb- 
ances. 

These speech disturbances (hereinafter 
referred to as SDs) are scored by listening 
to recordings of speech and marking each 
disturbance on a typewritten transcript at 
the precise place in the text where it occurs. 
The speech disturbance ratio (SDR) for 
any selected passage can then be calculated 
by dividing the number of speech disturb- 
ances by the number of words spoken. 

Mahl hypothesized that SDR varies di- 
rectly with fluctuations in the speaker’s 
anxiety level. As a test of this hypothesis he 
compared SDRs with his clinical judgments 
of anxiety in six therapeutic interviews with 
a patient he had treated 2' years previously. 
Contamination was avoided by having all 
SDs edited out of the typewritten tran- 
scripts from which the clinical judgments 
were made. His central finding was that the 
mean SDR was significantly higher for those 
phases of the interviews he judged “high 
anxious” than for those phases he judged 
“low anxious.” 

At the time Mahl’s study was published 
the present authors were trying to define and 
measure some aspects of patients’ speech 
which might reflect the immediate effects of 
specific therapeutic interventions. The SDR 
appeared to be a potentially useful research 
instrument for this purpose, provided that 
certain issues could be clarified. These issues 
are embodied in the following hypotheses 
which were the basis of this replication and 
extension of the Mahl study: 


H,. High anxiety interview phases, as 
judged by the therapist, have a greater mean 
SDR than low anxiety interview phases. 


160 





Speech Disturbance and Judged Anxiety 161 


This, of course, was Mahl’s fundamental 
finding. Our aim was straightforward replica- 
tion: could this effect be successfully demon- 
strated again using different 
therapist-judges? 


patients and 


Hy». High anxiety interview phases, as 
judged by persons other than the therapist, 
have a greater mean SDR than low anxiety 
interview phases. 


This hypothesis formalizes the question as 
to whether the judgment of anxiety requires 
the intimate, detailed, firsthand knowledge 
of the patient possessed only by the therapist. 
This question cannot be answered from 
Mahl’s results since he was both therapist 
and sole judge. 


H;. Independent judgments by different 
judges demonstrate substantial agreement as 
to the identification of phases of high and 
low anxiety. 


This is a test of the interjudge reliability 
of the criterion judgment of anxiety. It also 
tests one of the links in the logical chain 
which relates the SDR to anxiety, since the 
statement that A is a measure of B requires 
that values for both A and B can be deter- 
mined with acceptable reliability and that 
they covary systematically. Mahl’s study 
demonstrates only that the SDR is a reliable 
measure and that it covaries with Ais assess- 
ment of anxiety. The present study has 
sought evidence concerning the missing term, 
that is, the reliability with which fluctua- 
tions of anxiety within an interview can be 
judged. 


METHOD 


Subjects. The subjects for this study were two 
outpatients in a psychotherapy research project. 
Mrs. Alpha had been seen for approximately 150 
hours by one of the authors. Mrs. Beta had been 


seen for approximately 50 hours by the other 
author. All of their interviews had been tape 
recorded. 


Procedure. For each patient four interviews were 
selected randomly and transcribed verbatim for SD 
scoring. Two research assistants were trained to 
score the SDs and did all of the scoring for this 
study. The reliability of their scoring approximated 
the .94 figure reported by Mahl. After scoring, the 
transcripts were retyped with all SDs edited out 
The judgments of anxiety these 
edited transcripts. 


were made from 


Anxiety Judgments. These judgments were made 
according to the procedure described by Mahl in 
his report and in personal with the 
authors. The preparation for the judging task began 
with an extended clinical review of the case. The 
judges then listened to and discussed a selection of 
the recorded interviews, including those interviews 
immediately preceding and following the test inter- 
views, but excluding, of course, the test interviews 
themselves. 

After this preparation, which required about 15 
hours, the judges independently made the clinical 
judgments of anxiety on the test interviews. The 
task was to divide each interview into a series of 
phases, representing periods of relatively high or 
relatively low anxiety for the patient. In describing 
this judging task Mahl (1956) says: 


disc ussions 


During therapeutic sessions and while studying 
recordings it often appears that interviews are di- 
visible into “natural” segments or phases, each 
of which could be assigned to a single theme of 
content or interaction, and that the patient be- 
comes anxious and conflictful in 

becomes less anxious in others (p. 6). 


some, but 


The judging procedure, as described by Mahl, 
requires what can best be termed “immersion” in 
the interview material. The judges read and reread 
the typescripts at odd moments over a period of 
several weeks, making notes, revising, and trying 
out tentative approaches until the emerging phases 
seemed to stabilize. Finally these phases were marked 
off in the transcript and labeled “high” or “low” 
anxiety. The clinical preparation and judging were 
carried through to completion for Mrs. Alpha 
before beginning with Mrs. Beta. 

The judges were the authors, who judged both 
cases, and three additional volunteers, two of whom 
judged Mrs. Beta’s interviews and one of whom 
judged Mrs. Alpha’s. All five judges were trained 
psychotherapists with 5 to 10 years’ experience. 
RESULTS 


The data to be presented in this section 
will be organized in terms of the three hy- 
potheses listed in the introduction. 

H,. Only the phase judgments of the 
authors on their respective patients were 
relevant to this hypothesis. The data analysis 
was directly adopted from Mahl. For each 
phase of each interview a SDR was com- 
puted, and mean SDRs calculated for the 
high anxiety and low anxiety phases for each 
patient. 

2For individual interviews the number of high 
and low phases were not always equal. In order to 
avoid the bias of the “hour effect” described by 


Mahl (1956, p. 7) the excess number of high or low 
phases were randomly discarded from those inter- 
views in which the numbers were not equal 





162 


The results of this test are equivocal. In 
the case of Mrs. Beta the high anxiety phases 
established by her therapist had a signifi- 
cantly higher mean SDR than did the low 
anxiety phases (¢ = 2.29; p< .05). In the 
case of Mrs. Alpha there was no significant 
difference between these means. 

Hy». In the case of Mrs. Alpha, it will be 
recalled, two judges besides her therapist 
judged the interviews; for Mrs. Beta there 
were three additional judges. The data 
analysis was the same as that for H,. This 
hypothesis was not supported. None of the 
five sets of judgments by the nontherapists 
showed the predicted SDR discrepancy. 

H;. It must be stated at the outset that 
this hypothesis could not be directly tested 
without altering the design such that it would 
no longer constitute a replication. The in- 
herent statistical obstacle is that the free 
judgment instructions require the judges to 
establish their own units, i.e., motivational 
phases, which they are to label high or low 
anxiety. Since judges can and do differ with 
one another regarding the number and limits 
of the phases they discern in a given inter- 
view, there are no common units on which 
to base a quantitative statement about their 
agreement. 

Furthermore, no arbitrary division of the 
interview into comparison units, minutes, 
say, would serve the purpose. Such minute- 
units could not defensibly be regarded as 
independent events for statistical analysis 
because the original judgments had been 
made on groups of from 2 to 20 consecutive 
minutes. 

Since a direct test was impossible, a 
method was devised to permit a statement 
by indirection about interjudge agreement. 
The reasoning was as follows: (a) In at- 
tempting to specify interview phases of high 
and low anxiety, judges agree on some pas- 
sages and disagree on ovhers. (6) If the 
passages of agreement are not random, but 
do, in fact, reflect the judges’ consensus about 
the level of anxiety, Mahl’s hypothesis would 
predict systematic SDR differences between 
consensually high and low anxiety passages. 
(c) If the high and low consensual passages 
are not different with regard to SDR one of 
two things is true: interjudge agreement, 


Donald S. Boomer and D. Wells Goodrich 


where it exists, does not 
ment-about-anxiety, or 
not measure anxiety. 

The statistical procedure was as follows: 
Transcripts of the eight test interviews were 
divided into minutes. These minute-units 
were then consecutively tabulated in terms 
of their labeling as high or low anxiety by 
all judges. Consensual “phases” were con- 
structed by selecting all sequences of 2 or 
more consecutive minutes in which all judges, 
or all judges but one agreed. It was possible 
thus to construct eight high and eight low 
consensual phases for each patient. These 
constructed phases were comparable in length 
to the phases which had emerged from the 
individual judgments since they averaged 6 
minutes with a range of from 2 to 19 min- 
utes. In the aggregate, these phases were a 
large sample, representing more than 50% 
of the interview material. 

Analysis revealed that for neither patient 
did the high anxiety and low anxiety con- 
sensual phases differ with regard to mean 
SDR. In terms of the argument presented 
above, these data furnish no support for the 
hypothesis that consensus among judges 
represents consensus about anxiety, if anxiety 
is presumed to be accompanied by high 
SDRs. Further elaboration of this point will 
be reserved for the discussion section. 


represent 
else the SDR 


agree- 
does 


DISCUSSION 


The results of this study can be 
marized as follows: 
tients Mahl’s finding was successfully repli- 


sum- 
for one of the two pa- 
cated, i.e., her therapist’s judgments of 
periods of high and low anxiety were posi- 
tively and significantly related to the SDR 
during these periods. Anxiety judgments on 
the same interview material made by three 
additional judges failed, however, to show 
any significant relationship to the SDR. 

For a second patient her therapist’s anx- 
iety judgments and those of two additional 
judges uniformly failed to show any rela- 
tionship to SDR. Finally, those sections of 
both patients’ interviews identified by the 
majority of the judges as “more anxious” 
showed no higher SDR than those sections 
consensually labeled “less anxious.” 

These findings do not lend themselves to 





Speech Disturbance and Judged Anxiety 


the SDR. 
have been 


about 
however, 


any clear-cut conclusions 
The following 


clarified: 


issues, 


1. The Mahl hypothesis was supported in 
one of the cases, suggesting that his results 
have a degree of generalizability which war- 
rants further research. 

2. In the other case the findings were nega- 
tive. This indicates that the SDR cannot be 
uncritically accepted as a universal measure 
of intercurrent anxiety in psychotherapy in- 
terviews. Further investigation is necessary in 
order to specify the conditions under which 
the SDR can be meaningfully employed. 

3. The hypothesized relationship between 
SDR and anxiety judged from transcripts 
holds, when it does hold, only for the judg- 
ments made by the therapist who treated 
the patient. This was true of Mahl’s patient 
and of Mrs. Beta in the present study. Thus 
Mahl’s criterion, the therapists’ judgment, 
remains essentially private and unrepeatable. 


The unfortunate consequences of this state 
of affairs can be seen in the present study. 
In seeking to validate a new measure like 
SDR against a criterion of unknown reli- 
ability only positive results are informative. 
Negative findings leave the issues con- 
founded. The failure of replication with Mrs. 
Alpha, for example, may be interpreted in 
several equally likely ways. It is possible 
that her anxiety fluctuations are not reflected 
in her SDR, or that her anxiety fluctuations 
are not reflected in her written transcript, 
or that these particular judges are not suf- 
ficiently sensitive to 
fluctuations. 

These, of course, are not all of the possible 
interpretations, but they suffice to make the 
familiar point that one of the ends of a 
measurement hypothesis requires empirical 
anchoring. Mrs. Alpha’s data, being negative, 
show that the SDR hypothesis needs refine- 
ment; being equivocal, they provide no 
crucial evidence for any specific refinement. 


discern her anxiety 


Further Research 


The foregoing discussion indicated the 
need for more specification regarding the 
properties and the limitations of the SDR. 
Also illustrated were the shortcomings of the 


163 


research use of clinical judgment of anxiety 
fluctuations in providing the necessary cru- 
cial evidence. No simple solution can be 
suggested for this problem. No highly reli- 
able measure of anxiety can be offered as a 
substitute for clinical judgment, since no 
such measure exists. At our present stage of 
knowledge the phenomena associated with 
anxiety can be accounted for only by a 
complex description which involves observa- 
tions and inferences about physiological 
activity, private experience, and observable 
behavior, both verbal and nonverbal. The 
complex interrelationships among this loose 
network of phenomena are not now suffi- 
ciently explicit to warrant the belief that 
anxiety can be scaled any single 
dimension or level of 
functioning. 

In the face of such theoretical complexity 
the preferred research strategy might be to 
regard the SDR as a promising measure of 
certain aspects of anxiety in certain classes 
of people under certain conditions. Experi- 
mental work, foregoing the attempt to 
demonstrate that the SDR measures anxiety, 
could focus on some limited prior questions 
concerning the psychological properties of 
the measure and the manner and conditions 
of its covariance with other reproducible and 
reliable measures which may also be pre- 


along 


gauged at any one 


sumed to reflect some aspects of anxiety. 


Some research along these lines has al- 
ready begun. Panek and Martin (1959) 
building on Mahl’s work, have demonstrated 
with a group of psychotherapy patients that 
GSR dips are preceded by rising SDR gradi- 
ents and followed by declining gradients. 
Dittmann* is studying some temporal rela- 
tionships between SDR and certain body 
movements. The present authors are investi- 
gating possible relationships between SDR 
and rate of speech. The strategic advantage 
of this part-problem approach to anxiety 
measurement is that the use of reproducible 


five measures of 
inxiety: skin conductance, patients’ self-ratings, 
clinicians’ ratings, and two separate measures of 
speech disturbance. Of the 10 thus 
generated, 8 were not from 


Dibner (1958) intercorrelated 


correlations 
significantly different 
zero 

4 Personal communication, 1959. 





7 


164 Donald S. Boomer and D. Wells Goodrich 


methods and measures makes it possible 


ultimately to integrate these and other similar 
studies experimentally and conceptually. 


SUMMARY 


This research was an attempt to repeat a 
pioneering study by Mahl (1956), in which 
he demonstrated that the incidence of certain 
disturbances of speech increased during por- 
tions of psychotherapy interviews judged to 
be anxious, and decreased during portions 
judged less anxious. 

The results of the present test were in- 
conclusive. The anxiety judgments for one 
patient made by her therapist supported 
Mahl’s finding, but in a second case the 
judgments made by the therapist failed of 
replication. The judgments made by five 
additional judges who were not the patients’ 
therapists uniformly failed to show the hy- 
pothesized relationship to the speech dis- 
turbance measures. Reconciliation of these 
results must await further research. 

REFERENCES 
AuLp, F., & Murray, E. J. Content-analysis studies 
of psychotherapy. Psychol. Bull., 1955, 52, 377- 
395. 


Cuappte, E. D. Quantitative analysis of the inter- 


action of individuals. Proc. Nat 
1939, 25, 58-67. 

Dipner, A. S. Cue counting: A 
iety in interviews. J. 
475-478. 

Disner, A. S. Ambiguity and anxiety 
soc. Psychol., 1958, 56, 165-174. 

GoLpMAN-EIster, Frirepa. The determinants of the 
rate of speech output and their mutual relations 
J. psychosom. Res., 1956, 1, 137-143. 

Henze, R. Experimentelle Untersuchungen zur Pha- 
nomenologie der Sprachgeschwindigkeit. Z. exp 
angew. Psychol., 1953, 1, 214-243. 

LassweLL, H. D. Verbal references and physiological 
changes during the psychiatric interview: A pre- 
liminary communication. Psychoanal. Rev., 
22, 1-24. 

McQuown, N. A. Linguistic 
specification of psychiatric 
Psychiatry, 1957, 20, 79-86 

Maut, G. F. Disturbances and silences in the pa- 
tient’s speech in psychotherapy. J. abnorm. soc 
Psychol., 1956, 53, 1-15 

Mant, G. F. Measuring the patient’s anxiety during 
interviews from “expressive” aspects of his speech. 
Trans. NY Acad. Sci., Ser. 2, 1959, 21, 249-257. 

Panek, D. M., & Martin, B. The relationship be- 
tween GSR and speech disturbances in psycho- 
therapy. J. abnorm. soc. Psychol., 1959, 58, 402- 
405 

Pittencer, R. E., & Situ, H. L. A 
some contributions of linguistics to 
Psychiatry, 1957, 20, 61-68 


Acad. Sct., Wash., 


measure of anx- 


Psychol., 1956, 20, 


consult 


J. abnorm. 


1935, 


transcription and 
interview material 


basis for 
psychiatry. 


(Received March 24, 1960) 





Journal of Consulting Psycholog 
1961, Vol. 25. No. 2, 165-17 


NORMAL, HYPNOTICALLY INDUCED, AND FEIGNED 
ANXIETY AS REFLECTED IN AND DETECTED 
BY THE MMPI’ 


ALBERT A 


BRANCA ann EDWARD E 


PODOLNICK 


University of Delaware 


The use of hypnosis as a technique for 
the production of signs of disorder in normal 
people has been attempted. Luria (1932), 
and Huston, Shakow, and Erickson (1934) 
showed that word association techniques to- 
gether with certain motor responses were 
successful in revealing the presence of emo- 
tion arousing conflicts that had been sug- 
geted in hypnosis. Fisher and Marrow (1934) 
reported significant differences in reaction 
times obtained in hypnotically induced 
“moods” of elation and depression. Sweetland 
(1948) suggested certain psychiatric syn- 
dromes to normal subjects who had been 
hypnotized. Comparison of MMPI profiles 
obtained when these syndromes were sug- 


gested indicated that it was possible to pro- 


duce “laboratory neuroses” by hypnosis. 
Grosz and Levitt (1959) suggested anxiety to 
12 hypnotized medical and nursing students. 
They reported increased scores on the Taylor 
Manifest Anxiety Scale and diminished scores 
on the Barron Ego Strength scale. They also 
reported that scores on the two tests taken 
during the waking state did not differ from 
scores obtained during hypnotic states when 
anxiety was not suggested. 

Studies also show that the MMPI validity 
scales can identify dissemblers. Gough 
(1947) showed that the MMPI was able to 
identify ‘“fakers’ even when they were 
psychiatrists, clinical psychologists, and 
social workers who were familiar with the 
diagnostic signs of behavior disorders as well 
as the MMPI. Other investigators (Cofer, 
Chance, & Judson, 1949; Hunt, 1948) also 
indicate that the MMPI, through separate 

1 This research was supported by a University of 


Delaware Faculty Summer Research Grant 
2Now at Bucknell University 


or combined use of its validity scores, is 
capable of differentiating between  dis- 
semblers and other groups. 

In 1952 Welsh added an Anxiety (A) 
scale to the MMPI. This development has 
made it possible to observe the effects of 
suggesting this simpler and more general 
symptom of disorder. 

The specific hypotheses of this experiment 
are: 


1. There will be a significant increase in 
the A scale of the MMPI between scores 
obtained under normal conditions and under 
conditions of hypnotically induced anxiety. 

2. The validity scales of the MMPI will 
differentiate between profiles obtained under 
conditions of dissembling and profiles ob- 
tained under conditions of both normal and 
hypnotically induced anxiety. 


METHOD 
Subjects 


Ten students, of whom eight were female, were 
used as subjects in this experiment. The normal 
records were obtained from students who had taken 
the MMPI as part of a classroom demonstration. 
At the time the first profiles were obtained, the 
students were not aware that they might be called 
upon to participate in an experiment. Students 
from other freshman and sophomore courses volun- 
teered to participate when they had heard about 
the study. Two of these students were used as ex- 
perimental subjects. These two were given to believe 
that the MMPI was being used as a screening 
device and not a part of the experiment proper. 

Experimental candidates were selected on the basis 
of: (a) normal MMPI profiles and anxiety scores; 
b) absence of a history of treatment for mental 
disorder; (c) absence of a history of epilepsy, or 
convulsions, or neurological disease of any type; 
(d) a willingness to participate in the experiment. 
Actual subjects were selected from this larger group 
on the basis of hypnotizability. The criterion for 


165 





166 


depth of trance was the elicitation of positive audi- 
tory and visual hallucinations. Out of a total of 50 
experimental candidates, 10 met this criterion, 2 
males and 8 females. This percentage is consistent 
with others also reporting approximately 20% suc- 
cess in obtaining a deep trance (Dorcus, 1956). 
A disproportionately large number of females volun- 
teered to participate in the experiment. 


Procedure 


Each hypnotic session was held in a room with 
a one-way observation screen and an intercom 
system. In this way one experimenter was able to 
observe each session while the other performed the 
hypnosis. Each candidate was made aware of this 
observation. 

The first phase of the experiment consisted of 
training sessions wherein the experimental candidates 
were trained to achieve the trance state. When a 
depth of trance was reached in which positive 
auditory and visual hallucinations were produced, 
the candidate met the criterion for inclusion as 
an experimental subject and an anxiety state was 
suggested. The instructions for producing anxiety 
were obtained from definitions and descriptions of 
anxiety by various authors (Conklin, 1936; Heyns, 
1958; Lehner & Kube, 1955; May, 1950; Shaffer & 
Shoben, 1956; Warren, 1934). These instructions 
were as follows: 


You are beginning to feel very uneasy and 
anxious. You don’t know why, but this uneasy 
feeling is making you nervous, irritable, and 
frightened. You feel as if something dreadful is 
about to happen but you don’t know what. This 
feeling of dread is mingled with a curious feeling 
of hope that is very unpleasant. You are be- 
coming more and more apprehensive. You are in 
a state of anxious expectation and self-doubt. You 
feel now as if you are threatened and it frightens 
you. You feel as if you are about to lose some- 
thing important to you, or be hurt. This anxiety 
is becoming stronger and stronger. Now you feel 
something is wrong, as though you had 
neglected to do something very important, but 
you can’t recall what it is. You feel, though, 
that whatever it is, it is making you feel on edge 
and uneasy. It is making you feel blue, melan- 
choly, unhappy, and excited in an unpleasant 
way. You feel frightened, but you don’t know 
what it is you are frightened about. This is 
certainly an unpleasant form of excitement. You 
are now very apprehensive and anxious 


After the anxiety instructions were read, the 
MMPI was readministered. The subjects took be- 
tween 70 and 90 minutes to complete the MMPI 
In order to maintain the trance state for that period 
instructions and suggestions reinforcing the trance 
state were given when the subject had reached the 
halfway point in the test. At that time the anxiety 
instructions were also reread to each subject. The 
subjects were aroused from the trance state after 
suggestions counteracting the anxiety were made. 
These instructions, given twice, were as follows: 


as if 


Albert A. Branca and Edward E. Podolnick 


You are beginning to feel less apprehensive and 
anxious. The unpleasant form of excitement caused 
by the fact frightened is leaving 
you. You are beginning to feel 
alert, and relaxed. You no longer 
uneasy, and you art 
well-being. You 
You feel happy and at 
You are experiencing a 
you feel warm, comfortable, and alert 
You don't feel irritable, or frightened 
any more. You are no longer apprehensive and no 
longer feel self-doubt. You don’t feel as if you 
are frightened or are about to be hurt. You don’t 
feel as if you're about to lose something but 
don’t know what. You are now very relaxed. You 
feel as if all of your troubles and problems are 
leaving you. You feel as if all of your fears are 
gone and this feeling of ease and 
comfort. You are relaxed and normal 


that you were 
happier, more 
feel-on edge or 
experiencing a feeling of 
confident and at 
peace with the world. 


soothing calmness and 


are now ease 


relaxed 
nervous 


gives you a 


happy and 


At the hypnotic session 
to remember all 
trance state. 

In the final phase of the experiment, which took 
place approximately a week after the first phase, 
each subject was told that he was to make believe 
that he was anxious and that he was to “fake” 
anxiety while taking the MMPI. The same descrip 
tion of the anxiety state was read to him again 
with the statements “make believe that” or “pretend 
that” prefixing each sentence. He 
structed to mark the test as 


subjects were instructed 
events that occurred during the 


further in- 
though he were trying 


was 


TABLE 1 

MMPI T Score MEANS ANI 
UNDER NORMAL CONDITIOD 

HyPNOTICALLY INDUCED ANXIETY, AND 
CONDITIONS DISSEMBLING 


) STANDARD DEVIATIONS 
NDER CONDITIONS OF 


UNDER 


HIA D 
dition Conditior 


VU SD 


42 
4.81 
11.87 
8.99 
15.61 
17.31 
7.09 
14.49 
10.24 
21.49 
14.91 
19.25 
11.44 
13.12 
9.36 
8.62 


i 
Ons Nm ws 


10.90 


816 


vs 


an 
mw 


> 
i 


6.13 


7.95 ij 
6.48 5.3 8&7 
10.30 3.7 5 
8 67 56.5 4? 
$83 54.5 70 


7.07 46.7 5.54 


Duy 


> 
-_ © 


Note.—N 


3ased_on rav 





Anxiety in the MMPI 


TABLE 2 


ANALYSES OF SCORE 


N and HIA 


yd 


3.31 
1.88 
29.39** 
jae 
11.80** 
16.48** 
12.08** 
a” 
1.66 
16 45** 
Pt 17.93** 
Se 29.44** 
Ma 2.51 
Si 9.80** 
27.34** 


R 0.27 


= 


Nwre oO 


—m Ww © Ww 


“I bv 


Note Minus signs ind 

* Values of F obtained by analysis 
Based on raw s« 
* Significant at the .05 level 

** Significant at the .O1 ke 


ores 


to create the test profile of 
anxiety. 


a person suffering great 

In order to provide an additional subjective check 
of the subjects’ emotional states during the experi- 
ment, an anxiety rating constructed ac 
cording to the Likert technique (Edwards, 1957). 
It consisted of 40 questions about the way the 
subject felt at the time of responding to the scale. 
It included items such as: “I 
tense without any 
low,” “I am restless and irritable now.” Each of its 
40 items had been shown to 
high anxious and low 
split-half reliability 


S€ ale was 


ease,” “I feel 
“My morale is 


am at 
good reason,” 
discriminate between 
anxious groups. It 
coefficient of This scale 
administered with the MMPI as part of a 
classroom demonstration. It was given under 
conditions of hypnotically anxiety. 


has a 
97. 
was 
also 


induced 


RESULTS 


The data of this experiment consisted of 
the MMPI profiles obtained under normal 
conditions (N), conditions of hypnotically 
induced anxiety (HIA), and conditions of 
dissembling (D), as well as scores on the 
anxiety rating questionnaire obtained under 
the first two conditions. The means and the 
standard deviations of the 7 scores for each 
scale are listed in Table 1. Both the ? and 
F scales are,given in raw scores. The scores 
on ? were well enough below the necessary 


UNDER NORMAL, HyPNOTICALLY 
DISSEMBLING 


condition | 
e under 


INDUCED ANXIETY, AND 


CONDITIONS 


HIA and D 


MD 


N and D 


—.1 
—3.9 
19.0 
—§8.4 
20.5 
23.2 
16. 


— 


) 
— 
Nin = t 


26 


_ w bt 
aw 
wens OUI Oo © 


_ 
Nm oO 


listed were lower than those 


the three conditions 


of the first 
#% the experiment 


30 that conversion to 7 scores would neces- 
sitate each raw score having the same 7, 
i.e., a T of 50. The scores on F were so high 
under conditions of dissembling that con- 
version to JT scores tended to hide differ- 
ences between this condition and the other 
two. Because of the small VN (V = 10), N-1 
was used in computing the standard devia- 
tions (Edwards, 1950). 

An analysis of variance was performed for 
each scale under the three conditions. A com- 
parison was then made between the 7 scores 
on each scale obtained under N and HIA 
conditions, N and D conditions, and HIA 
and D conditions. A ¢ test was employed 
for this purpose. Because the scores obtained 
under these three conditions were not random 
with respect to each other, a ¢ comparing 
the differences between correlated means was 
computed using the differences between the 
scores (McNemar, 1949). The mean differ- 
ences and the ¢’s for each scale for all combi- 
nations of the three conditions are listed in 
Table 2. Although all values of ¢ were re- 
ported they were marked as significant, in 
the conventional manner, only for those 
scales where significantly large Fs were 
obtained. 





168 Albert 

In comparing the scores between the N 
and HIA conditions, the F, K, Pa, Sc, Si, 
and A scales showed significant changes. In 
comparing the scores between the HIA and 
D conditions, only the 7, L, Mf, Ma, and R 
scales showed insignificant changes. Likewise, 
the differences between the scores for N and 
D conditions were insignificant for the L, 
Ma, ?, Mf, and R scales, being significant 
for all others. 

A t was also computed for the anxiety 
rating questionnaire. The scores on this 
questionnaire, given twice (N and HIA 
conditions) showed a mean change which 
is significant at the .01 level. 


DISCUSSION 
Observation of Subjects 


During the HIA session, there were indica- 
tions of stress on the part of the subjects. 
When asked how they felt, they made com- 
ments such as: “I don’t feel good,” “I feel 
like I want to get out of here,” “I feel un- 
happy,” and “I feel as if something were 
wrong.” In addition to these comments, the 
subjects showed signs that the experimenters 
interpreted as discomfort and distress. These 
signs were: furrowing of the brow, clenching 
of hands, frowning, tenseness, biting of lips, 
sighing deeply. One subject, a female, burst 
out crying while answering the MMPI items. 
The experimenter stopped the test and, see- 
ing that she could not continue because of 
excessive crying, read the alleviating instruc- 
tions. She stopped crying as the instructions 
were being read and agreed to complete the 
test the following day under the same experi- 
mental conditions. 

In general, the subjects concentrated on 
the test and appeared to be making an effort 
to read and answer the items carefully. All 
subjects appeared relieved when the allevi- 
ating instructions were read. This relief was 
evidenced by smiling, relaxation of facial 
muscles, restrained laughter, and remarks 
such as: “I feel good now.” 


The A Scale 


Hypothesis 1 stated that there would be 
a significant increase in the A scale between 
scores obtained under normal conditions and 
under conditions of hypnotically induced 


A. Branca and Edward E. Podolnick 


anxiety. This hypothesis was supported by 
the data, the difference in the scores being 
significant at the .01 level. 

The A scale is made up of items which 
occur on several of the other scales. Cluster 
and factor analyses indicated that the scale 
was relatively homogeneous and seemed to 
be related to anxiety. The scale contains very 
few “obvious” items that deal directly with 
the word anxiety and its synonyms. However, 
the experimenters found seven such items 
that they considered obvious with respect to 
the “anxiety” instructions the subjects re- 
ceived: “I feel anxiety about something or 
someone almost all of the time,’ “I must 
admit that I have at times been worried 
beyond reason over something that really did 
not matter,’ “I worry quite a bit over pos- 
sible misfortunes,” “I brood a great deal,” 
“TI wish I could be as happy as others seem 
to be,” “Most of the time I feel blue,” “I 
very seldom have spells of the blues.” A 
count was made of the number of times 
these seven items were chosen under the 
three conditions, N, HIA, and D. They were 
chosen a total of 13 times under N condi- 
tion, 34 times under HIA condition, and 63 
times under D condition. These differences 
were significant at the .01 level. A ¢ was then 
computed for the A scale with these seven 
items omitted to find if significant differ- 
ences would still be obtained without the 
obvious items. The removal of these items 
did not alter the degrees of significance ob- 
tained previously with the full scale. This 
reduces the likelihood that the elevation of 
the A scale was the simple result of a height- 
ened and conscious intent to comply with 
the suggestions of the experimenters. 


The Validity Scales 


Hypothesis 2 stated that the validity scales 
would differentiate between profiles obtained 
under conditions of dissembling and profiles 
obtained under conditions of both normal 
and hypnotically induced anxiety. This hy- 
pothesis was supported. Using the validity 
scales separately and the F minus K dis- 
simulation index with a cutoff point of plus 
five (Gough, 1950), all profiles from both the 
N and HIA conditions were in the normal 
range indicating valid profiles. In addition, 
the F scale alone identified all but one of the 





Anxiety in 


profiles obtained under conditions of dis- 
sembling. The F minus K index also did not 
identify this one profile, but identified all 
others. 

The significant changes in the F and K 
scales obtained from Condition N to Condi- 
tion HIA do indicate that a change in 
test-taking attitude occurred in the latter 
state. The K scores were significantly lower 
in Condition HIA as compared with Condi- 
tion N, indicating that the subjects became 
more critical of themselves. In addition, the 
F scores were significantly higher in the HIA 
condition, indicating that while in this state, 
the subjects answered more of these items in 
the direction away from the direction the 
normal standardization groups answered 
them. The experimenters feel, however, that 
this might be expected in that the HIA state 
represents a condition removed from the 
condition under which standardization was 
obtained. 

It is interesting to note that, although not 
significantly so, Z scores were consistently 
lower in the HIA and D states as compared 
to N. Perhaps the criticalness of the subjects, 
as evidenced by their lower K scores in the 
HIA and D conditions, also made them more 
“honest.” 

An hypnotic scale has not been derived 
from the MMPI item pool. Admittedly such 
a scale would be of little clinical value, but 
it would be of considerable experimental 
interest. A scale capable of differentiating 
between the waking and the hypnotic state 
would serve as an objective device for indi- 
cating achievement of the trance state. It 
might also provide the basis for an objective 
method of appraising depth of trance. 


The Diagnostic Scales 


All diagnostic scales except Mf, Ma, and R 
showed significant differences from N to D 
conditions. All diagnostic scales except Mf, 


Ma, and R showed significant differences 
from HIA to D conditions. Since the validity 
scales identified dissembling profiles in Con- 
dition D, but not in Condition HIA, these 
differences further support these results in 
indicating a real difference between the two 
conditions. Profiles obtained under conditions 
of hypnotically induced anxiety do not re- 
semble those obtained under conditions of 


the MMPI 169 
dissembling. The differences between profiles 
obtained in these two conditions suggest that 
hypnosis is not a state of mere heightened 
cooperation. 

In going from Conditions N to HIA, it 
might be expected that only the A scale 
would be increased, since the instructions 
under hynopsis were directed to this effect. 
However, the Pa, Sc, and Si scales were also 
significantly heightened. 

The rise in Pa may be considered as a 
direct result of the anxiety instructions. 
Looking post priori, it can be seen that sug- 
gestions such as “You are beginning to feel 
very uneasy and anxious,” and “You don’t 
know why, but this uneasy feeling is making 
you nervous, irritable, and frightened,” might 
easily be reflected in the items composing 
the Pa scale as these items were derived from 
patient samples “symptomatically to 
have ideas of reference, to feel that they 
were persecuted by individuals or groups. 

’ (Hathaway, 1956, pp. 109-110). 

The heightened Sc and Si scales can be 
considered in the same way. Feelings of ap- 
prehension and anxietv, as well as feelings 
of being blue and melancholy, might be re- 
flected in items dealing with social intro- 
version, and schizophrenic symptoms have 
been described in much the same way. The 
D scale, however, was not significantly in- 
creased, indicating that the part of the 
instructions relating to feelings of depression 
were not alone responsible for these changes. 

The change in scores in the Likert-type 
anxiety questionnaire from Conditions N to 
HIA were significant at the .01 level. The 
questionnaire was not give under Condition D 
because of its transparency. The change 
indicates that each subject’s subjective 
evaluation of the way he felt during the HIA 
session corresponded to the overt behavioral 
differences observed and to the detection of 
these differences effected by the A scale. Most 
of the scores doubled and some increased by 
as much as 100 out of a possible 160 points. 
Only 1 subject out of 10 failed to report an 
increase in anxiety as reflected in the ques- 
tionnaire. 

It is impossible to estimate the effect of 
the order of the experimental conditions upon 
test performance. The order used in this 
design was chosen so as to minimize con- 

? 





170 


tamination of experimental conditions by 
prior experience. Obviously the “normal” 
administration of the MMPI which served 
as a control and also as the basis for select- 
ing subjects had to come first. It was felt 
that the faking conditions should be last in 
order to prevent the establishment of a faking 
set which might persist and intrude upon the 
hypnotically induced anxiety condition. 


SUMMARY AND CONCLUSIONS 


Ten college students took the MMPI 
under three conditions. In the first condition, 
the test was taken as part of a classroom 
demonstration. The second administration 
occured under conditions of hypnotically in- 
duced anxiety. The third administration 
occurred in the waking state after instruc- 
tions were given to “fake” anxiety. 

Comparisons of the test profiles and the 
results of the specially constructed anxiety 
questionnaire permitted the following con- 
clusions to be drawn: 

1. Anxiety suggestions, when culled from 
definitions and descriptions of anxiety from 
independent sources, cause a significant in- 
crease in the Welsh A scale of the MMPI 
when they are given to hypnotized subjects. 

2. Overt behavioral signs indicate that af- 
fective changes are experienced when anxiety 
is suggested to hypnotized subjects. 

3. The anxiety questionnaire revealed that 
the subjects reported a marked increase of 
feelings of tension, discomfort, unpleasant- 
ness, and apprehension following the 
“anxiety” instructions in the hypnotized 
state over their reports in the normal 
waking state. 

4. The validity scales of the MMPI suc- 
cessfully identify 9 out of 10 dissemblers and 
show that, in a state of hypnotically induced 
anxiety, valid profiles are obtained. 

5. Significant differences in the diagnostic 
and validity scales between conditions of 
hypnotically induced anxiety and conditions 
of dissembling indicate that the former is dif- 
ferent enough from the latter to strongly sug- 
gest that hypnosis is not a state of mere 
exaggerated cooperation. 


Albert A. Branca and Edward E. Podolnick 


REFERENCES 


Corer, C. N., Cuance, J., & Jupson, A. J. A study 
of malingering on the Minnesota Multiphasic Per- 
sonality Inventory. J. Psychol., 1949, 27, 491-499. 

ConkKLIn, E. Principles of abnormal psychology. 
New York: Holt, 1936 

Dorcus, R. M. (Ed.) Hypnosis and its therapeutic 
applications. New York: McGraw-Hill, 1956. 

Epwarps, A. L. Experimental design in psychological 
research. New York: Rinehart, 1950. 

Epwarps, A. L. Techniques of attitude scale con- 
struction. New York: Appleton-Century-Crofts, 
1957. 

FisHer, V. E., & Marrow, A. J. Experimental study 
of moods. Charact. Pers., 1934, 2, 201-208. 

Govcn, H. G. Simulated patterns on the Minnesota 
Multiphasic Personality Inventory. J. 
soc. Psychol., 1947, 42, 215-225. 

Govucu, H. G. The F minus K dissimulation index 
for the Minnesota Multiphasic Personality In- 
ventory. J. consult. Psychol., 1950, 14, 408-413. 

Grosz, H. J., & Levitt, E. E. The effects of hyp- 
notically induced anxiety on the Manifest Anxiety 
Scale and the Barron ego-strength scale. J. abnorm. 
soc. Psychol., 1959, 59, 281-283. 

Hatnaway, S. R. Scales 5 (masculinity-femininity), 
6 (paranoia) and 8 (schizophrenia). In A. S. 
Welsh & W. G. Dahlstrom (Eds.), Basic readings 
on the MMPI in psychology and medicine. Min- 
neapolis: Univer. Minnesota Press, 1956. 

Heyns, R. W. The psychology of personal adjust- 
ment. New York: Dryden, 1958. 

Hunt, H. F. The effect of deliberate deception on 
Minnesota Multiphasic Personality Inventory 
profiles. J. consult. Psychol., 1948, 12, 396-402. 

Huston, P. E., SHaxow, D., & Ertcxson, M. H_ A 
study of hypnotically induced complexes by means 
of the Luria technique. J. gen. Psychol., 1934, 11, 
650-697. 

Leuner, G. F., 
sonal 
1955. 

Luria, A. R. The nature of human conflict. New 
York: Liveright, 1932. 

McNemar, Q. Psychological 
Wiley, 1949. 

May, R. The 
Ronald, 1950. 

Suarrer L., & SHopen, E. The psychology of ad- 
justment. (2nd ed.) Boston: Houghton Mifflin, 
1956. 

SweeTtLanp, A. Hypnotic neuroses: Hypochondriasis 
and depression. J. gen. Psychol., 1948, 39, 91-105. 

Warren, H. C. Dictionary of psychology. Boston: 
Houghton Mifflin, 1934. 

Wetsu, G. S. An anxiety index and an internaliza- 
tion ratio for the MMPI. J. consult. Psychol, 
1952, 16, 65-72. 


abnorm. 


& Kuper, E 


adjustment. New 


The dynamics of per- 
Jersey: Prentice-Hall, 


New York: 


tatistics. 


meaning of anxiety. New York: 


(Received March 28, 1960) 





lournal of ( 


onsulting Psychology 
1961, V 5, Ne 


2, 171-177 


THE EFFECT OF BRAIN DAMAGE UPON SPEED, 
ACCURACY, AND IMPROVEMENT IN VISUAL 
MOTOR FUNCTIONING 


KENNETH B. STEIN ! 


Veteran 


The purpose of this study was to investi- 
gate three aspects of visual motor functioning 
in relation to cortical deficit. These aspects 
were visual motor speed, accuracy in visual 
motor reproduction, and improvement. These 
features were observed simultaneously within 
a single unitary time span and a single uni- 
tary task. The relative importance of each of 
these modalities as well as_ their 
relationships were also studied. 

Klebanoff (1945) and Klebanoff, Singer, 
and Wilensky (1954) have extensively re- 
viewed studies of brain damage. The reviews 
reveal that some of these three visual motor 
features have been studied either implictly or 
explicitly. 

The speed variable is dealt with in timed 
tasks. Some of the performance subtests of 
the WAIS (Wechsler, 1955) lend themselves 
to measures or scores based on time or time 
bonus credits. Almost all investigators have 
found some of these visual motor subtests 
impaired by cortical damage such as the 
Digit Symbol (Klebanoff et al., 1954, p. 13) 
and the Block Design (Aita, Armitage, 
Reitan, & Rabinowitz, 1947; Allen, 1947, 
1948; Goldman, Greenblatt, & Coon, 1946; 
Greenblatt, Goldman, & Coon, 1946; Lidz, 
Gay, & Tietze, 1942). Another example of 
visual motor speed tasks was the cancellation 
test techniques used by Hunt (Freeman & 
Watts, 1939) and Rylander (1939). They 
found reduced speed following lobotomy. 
Each of them also employed an accuracy 


inter- 


1The author wishes to thank R. C 
versity of California, Berkeley, and F. Henderson 
and staff members of the Veterans Administration 
Clinic, San Francisco, for their helpful suggestions 
and critical reading of the manuscript 


Tryon, Uni- 


171 


idministration Regional Office, 


San Francisco 


score. Hunt found that subjects demonstrated 
greater accuracy after lobotomy. Rylander 
did not find any significant difference in 
accuracy scores. 

As with tests for speed, there are specially 
devised techniques which focus upon the 
qualitative or accuracy aspect of functioning 
by the brain injured. The Bender-Gestalt 
(Bender, 1933) and the St. Louis Memory- 
for-Design (Graham & Kendall, 1946) Tests 
involve the reproduction of geometrical 
figures. In these tests it is often noted that 
subjects with cortical lesions tend to re- 
organize the figures qualitatively in the 
direction of simplification. Fragmentation, 
rotation, reversal, and greater closure and 
balance are some of the manifestations of 
simplification. These perceptual motor dis- 
turbances have been discussed in terms of 
Goldstein’s theory of concrete and abstract 
attitudes (Goldstein & Scheerer, 1941) as 
well as gestalt theory (Bender, 1933). 

One would expect that the concrete atti- 
tude of the organic would be revealed not 
only in its effect upon speed and accuracy 
but also upon improvement. Goldstein and 
Scheerer (1941) indicate that the organic 
does not profit as readily from experience as 
the normal. McFie and Piercy (1952) found 
impairment of retention and learning which 
was related to the size of the lesion. 

The main hypothesis for investigation in 
the present experiment that 


was organics 


differ significantly from nonorganics in speed, 
accuracy, and improvement in visual motor 


functioning. As an auxiliary problem an 
exploration was made of the interrelation- 
ships of these three variables as well as two 





172 


additional ones, IQ and age. Finally, each 
variable as well as the combination of these 
variables was assessed for power of discrimi- 
nation between organics and nonorganics. 


METHOD 
Subjects 

There were 60 subjects with cortical brain 
damage and 120 controls. All were white and 
American born. Of the 120 controls 15 were female 
and 105 male, whereas all the organics were male.* 
Forty-two of the experimental subjects were VA 
outpatients and the remaining 18 were hospitalized 
veterans. Diagnostically the organic subjects were 
distributed as follows: 27 posttraumatic encephalop- 
athies, 14 cortical insults associated with circula- 
tory disorders, and 8 preoperative and postoperative 
tumors. The remaining 11 experimental subjects 
had varying diagnoses such as paresis, encephalitis, 
tuberculous meningitis, and cortical atrophy. None 
of them was considered psychotic. Of the 120 con- 
trols, 51 were VA outpatients with nonpsychotic 
psychiatric diagnoses; 29, outpatients in treatment 
for tuberculosis; and 40, general medical outpatients. 
None of the experimental and control subjects had 
any noticeable visual defect or motor impairment 
of their writing hand. 

The 180 subjects were drawn from a larger pool 
of 261 subjects in order that the organics and 
controls could be equated for age, education, and 
IQ. Other clinical groups such as various psychotics 
will be used in a subsequent study and should 
provide a comparison with the results on the present 
groups. 


Procedures 


Vocabulary Subtest. The Vocabulary subtest of 
the Wechsler-Bellevue scale, Form I (Wechsler, 
1944) was used to obtain an estimate of verbal 
IQ. Morrow and Marks (1955) found no significant 
difference between brain injured and control sub- 
jects on vocabulary. This lack of difference led the 
authors to conclude that this measure was an ade- 
quate indicator of premorbid IQ. Both Jackson 
(1955) and Rapaport (1945) report that the 
Vocabulary subtest tends to be relatively refractory 
to impairment. 

Symbol-Gestalt Test. It was expected that the 
organics with their concrete attitude would show 
significantly greater difficulty than nonorganics with 
both speed and accuracy as well as with improve- 
ment. As a means of tapping these three aspects 
of visual motor functioning, a symbol substitution 


2As part of an earlier unpublished study the 
author found no significant differences for the sex 
variable on the Symbol-Gestalt procedure used in 
this experiment. The 22 females’ mean score = .687, 
SD .536; 22 males’ mean score = 606, SD .448. The 
t = .534 which is insignificant. Both groups were 
equated for age and IQ. 


Kenneth B. Stein 


task was developed. The format is similar to the 
Digit Symbol subtest of the WAIS but new symbols 
have been devised* (see Figure 1) 
were constructed to have poor 
they lack closure and balance. They have gaps and 
unequal as well as nonparallel lines. This visual 
motor task has a 3-minute time limit and the num- 
ber of substitutions each minute was 
recorded by the experimenter. There are a total 
of 110 substitution items. The time limit prevents 
subjects from completing all items so that no one 
achieves a maximum 


These symbols 
gestalt form, ie., 


made for 


score 








erat @iSi6eivrtieis 
NiIS| 4 Ol[=lAala{[= 


Symbol-Gestalt Test—symbols and numbers. 



































Fic. 1 


This task yielded one speed, one improvement, 
and three accuracy which could be 
statistically compared for the experimental and 
control groups. The speed score (3-minute complete) 
was the total number of substitutions made in the 
3-minute time limit. The improvement measure was 
calculated by subtracting the total number of correct 
symbols in the first minute from the total number 
in the third minute. The three accuracy measures 
were: (a) the total number of correct substitutions 
in 3 minutes (3-minute correct); (b) the percent 
error (% error) based on the number of errors in 
the first 40 substitutions; and (c) the number of 
qualitative errors (Q _ error)—rotations, reversals, 
wrong substitutions, and distortions. The instruction 
for administration of the test to the subjects was 
similar to that of the Digit Symbol subtest 
(Wechsler, 1955). 


measures 


RESULTS 


Equality in Education, Age, and 10 


In the statistical treatment of the meas- 
ures, ¢ tests were calculated. Table 1 reveals 
first that the two groups were equated for 
education, age, and IQ. Since these variables 
may be related to speed, accuracy, and 
improvement jndependently of brain damage, 
it was necessary to hold them constant. Thus 
it was possible to assess the unique effects 
of organic injury upon the visual motor 
functions. 


Differences between Organics and Controls 


The controls demonstrated 


significantly 
greater speed than the organics as shown in 
the 3-minute complete score. The mean dif- 


% Although this test was 
it was first used by 
form. 


devised by the author, 
Phelps (1952) in a modified 





Brain Damage and Visual Motor Function 


TABLE 1 


MEANS, STANDARD DeviATIONS, RANGES, 


Variabk V } V V 


11.17 2 06 
39.91 51 
114.39 1.12 
59.02 16.99 
3-minute correct 50.08 20.16 
% error 14.95 14.93 
Q error 63 99 
Improvement 2.16 1.71 


Education 


3-minute complete 


* p> at .01 level 
* > <.001 


ference of 16.99 symbols achieved a p at 
below the .001 level (see Table 1). 

Of the three accuracy measures, the 3- 
minute correct score yielded the largest ¢ 
value of 9.19, which has a p also below the 
.001 level. The controls completed 20.16 
more correct symbols than the brain damaged 
subjects. 

The % error, the second accuracy meas- 
ure, revealed that the controls committed 
half as many errors as the experimental 
group, resulting in a ¢ of 5.89 which is sig- 
nificant at less than the .001 level. The 
organics had a wider range and greater 
variability of % error scores. 

In the Q error, the third accuracy meas- 
ure, the organics produced almost three times 
as many such errors as the nonimpaired 
group. The ¢ of 2.73 is significant at less than 
the .01 level. Although the controls showed a 
wider range of these errors (0-15) than 
the organics (0-13), this was due to one 
control subject who was the only one with 


AND ¢ VALUES FOR 


Range, 


120 ConTROLS AND 60 ORGANICS 


Range SD 
16 5—16 
21—65 
92—139 
7 —90 
59 


1 


85 
13 


+9 


more than 5 such errors. Over 10% of the 
organics made 5-13 qualitative errors. 

Improvement, the last measure, showed 
the organics had improved very little whereas 
the controls had a mean improvement of 
more than two symbols. The ¢ value was 3.29 
which is significant at the .001 level. 


Relationship between Variables 


To ascertain a measure of amount of im- 
pairment of the three aspects of visual motor 
functioning, point biserial correlations were 
calculated between each and _ the 
dichotomous criterion variable, i.e., organic— 
nonorganic. In order of size the correlations 
for the five scores were as follows: 3-minute 
correct .548, 3-minute complete .451, % error 

.440, improvement .245, and Q error —.230. 
In addition age and IQ had near zero cor- 
relations, .018 and .056, respectively. 

Table 2 shows the intercorrelations for 
the seven variables on the entire sample. 
Although there were a number of significant 


score 


TABLE 2 


INTERCORRELATIONS FOR TOTAL SAMPLE ON SEVEN VARIABLES 


3-Minute 

Variable Compile te 
Age 549 
IQ 248 
3-minute complete 
3-minute correct 
% error 
Q error 
Improvement 

Note 193 and 


Correlations of 146 are signific 


3-Minute 
Correct 


Q Error Improvement 
182 147 
068 O85 

—.117 203 

— 335 233 
555 — .085 


~ 066 


ant at the .01 and .05 levels, respectively. 





Kenneth B. Stein 


TABLE 3 


INTERCORRELATIONS FOR CONTROL 


3-Minute 

Variable Complete 
O & 

- 620 

101 


Age 

IQ 

3-minute complete 
3-minute correct 
% error 

Q error 
Improvement 


234 and .179 are signific 


vel 
vely 


e.—For controls 
O05 levels, respect 


Not 
O01 and 


correlations, only a few were large enough 
to account for a sizeable amount of the 
variance. The speed score (3-minute com- 
plete) correlated .895 with one of the ac- 
curacy scores, 3-minute correct, but showed 
low correlations with the remaining two 
accuracy scores (with % error —.196, with 
Q error —.117). The speed score also had 
a high relationship of —.549 with age. Like- 
wise the 3-minute correct accuracy score was 
highly related to age (—.585). 

As might be expected the three accuracy 
measures were significantly correlated with 
each other. The 3-minute correct with % 
error yielded an r of —.581, 3-minute correct 
with Q error was —.335, and Q error with 
% error was .555. Thus 3-minute correct 
correlated negatively with the other two 
accuracy scores, whereas Q error and % 
error were positively related to each other. 

In order to compare the organic and 
control groups further, a correlation matrix 


AND ORGANIC 


— 680 
287 
874 


Groups ON SEVEN VARIABLES 


3-Minute 
Correct 


Improvement 


8) + 
250 —.193 
152 048 
— 149 069 
- 380 .109 

.020 
— 017 


O 

— 708 

— (66 
849 


576 


was obtained for each group on the seven 
variables. These correlations are found in 
Table 3. There were a few correlations which 
seem to be divergent for the two groups. 
The correlations between age and IQ went 
in opposite directions for the two groups. 
The controls showed an r of —.244 which 
is significant at the .01 level. The organics 
showed an r of .187 which falls short of 
significance. Yet the difference between these 
two correlations transformed to Fisher’s 2’ 
values yielded a z equal to 2.284 which is 
significant at less than the .05 level. The 
nonorganics showed a significant relationship 
between IQ and 3-minute correct (.295) but 
the organics did not (—.066). The difference 
is significant with a z of 2.228. 

The organics had a higher correlation be- 
tween 3-minute correct and % error (—.616) 
than the controls (—.374). The z of 2.006 
is again significant for the difference be- 


TABLE 4 


THe CLASSIFICATION ACCURACY 
CONTROLS 


Cutoff 
Variable Score # Overlap 
O 
18 
20 
33 
48 
34 


3-minute complete 
3-minute correct 
©) error 

Q error 
Improvement 


oF EACH VARIABLI 
anD 60 ORGANICS 


BASED 120 


“> Correct 
Classification 





Brain Damage and Visual Motor Function 175 


TABLE 5 
Ine CLASSIFICATION ACCURACY FOR Six VARIABLES 


ComMBINED UsinG B COEFFICIENT 


tween the two correlations. The remaining 


correlational differences are not significant. 


Diagnostic Classificatory Power of the Meas- 
ures 


Table 4 indicates the empirical cutoff 
point which maximizes differentiation be- 
tween organic and nonorganic groups for 
each variable. These cutoff points were deter- 
mined on the present sample. The 3-minute 
correct score had the highest discrimination 
with 79% correct classification. Next was 
% error with 76% followed by 3-minute 
complete, Q error, and improvement. In 
each case the controls displayed a smaller 
percentage of misclassification than did the 
organics. 


The classificatory power of these variables 
taken singly and in combination is shown in 
the comparison of Tables 4 and 5. In a 
previous study involving 261 subjects from 
which the present 180 subjects were drawn, 


B coefficients were determined for each of 
these five variables as well as for the sixth, 
ie., age.* These B coefficients were applied 
to the subjects in the current study. There- 
fore the results are not strictly cross- 
validative in nature. The results reflected a 
much smaller misclassification for both 
groups. When the six variables were com- 
bined, the controls showed a misclassification 
of 10.83% and the organics 13.3%. For the 
total sample the incorrect classification was 
11.67%. 
DISCUSSION 


The results indicate that the organics are 
significantly impaired in speed, accuracy, and 


* An unpublished preliminary study by the author. 
B coefficients were as follows: 3-minute correct 
.00779, 3-minute complete .02194, age .02087, 
%o error —.02068, Q error —.00624, and: improve- 
ment .04118. 


improvement in visual motor functioning. 
Since the r,, of the 3-minute correct is of 
greater magnitude than that of the other 
scores, it that the organics suffer 
most in the accuracy function. Further in- 
vestigation raises some question about such 
a conclusion. Table 2 discloses an extremely 
high correlation of the 3-minute correct ac- 
curacy measure with the speed score indicat- 
ing that the 3-minute correct is not strictly 
an independent accuracy measure. Since this 
score is also highly correlated with the other 
two accuracy measures, it suggests that the 
3-minute correct taps a combination of the 
speed and ‘accuracy functions. Both the Q 
error and % error with their low correlations 
with speed appear to be fairly independent of 
speed. In view of these findings, a reconsid- 
eration of the hierarchy of 7,,’s suggests that 
the speed function is most impaired, closely 
followed by one of the accuracy measures, 
i.e., % error, and then by improvement. 

Since the 3-minute correct score seems to 
be a combination of both speed and accuracy 
as well as having the largest 7,,, it suggests 
that a combination of variables or functions 
is more likely to reveal a greater difference 
between groups than any single variable. Evi- 
dence for this inference is pointed up in a 
comparison of Tables 4 and 5. Not only is the 
percent correct classification greater for the 
combined variables, but also the organic and 
control groups show less divergence since they 
are within 2.5% of each other. 

An additional reason for the higher dis- 
criminatory power of the combined scores is 
the inclusion of the age variable. Since the 
two groups were equated for age, this variable 
had no effect by itself in separating organics 
from nonorganics. Yet age did have a definite 
influence within groups upon visual motor 
performance since it was significantly corre- 
lated with the three main variables. These 
correlations reveal that as age increases, 
speed, improvement, and accuracy decrease. 
These findings are similar to Wechsler’s re- 
sults (1944, 1955). The function that age 
serves in combination with the other scores is 
that of a suppressor variable. 

The IQ variable shows a low but significant 
correlation with speed but no relationship to 
accuracy and improvement. These findings 


suggests 








176 


are at variance with Wechsler (1955) who 
revealed a correlation of .64 between Vocab- 
ulary IQ and the Digit Symbol subtest. This 
difference can be explained partly by the ab- 
sence of the lower quartile from the IQ range 
in the present study. Another factor which 
may contribute to the lower correlation is 
that organics with negligible correlations are 
more heavily weighted in the present sample 
than would be expected in the general popula- 
tion with which Wechsler dealt. 

The results in Table 3 show three sets of 
correlations to be significantly different for 
the two groups. On age with IQ the two 
groups went in opposite directions. The ex- 
planation may be that merely by chance this 
organic group was more heavily weighted for 
the higher IQs in the older age range than the 
controls. Likewise by chance the controls were 
more heavily weighted for lower IQs in the 
younger age range. The correlation of IQ and 
3-minute correct shows the controls tending 
in the direction of Wechsler’s result (1955) of 
a definite influence of IQ upon the Digit Sym- 
bol subtest score. With the impairment in 
visual motor functioning in the organic, IQ 
seemingly plays very little part in affecting 
the magnitude of his score. The third set of 


correlations that yielded a significant differ- 
ence between the two groups was 3-minute 


4 


correct and % error. The factor which may 
account for the difference is that the organic 
group made a significantly greater number of 
errors thus lowering their 3-minute correct 
scores. 

If we exclude the 3-minute correct score, 
which was found to be a combination of speed 
and accuracy, Table 2 presents the striking 
finding that the intercorrelations are low for 
the measures tapping the three functions of 
speed, accuracy, and improvement. This sug- 
gests that the three functions are relatively 
independent. The conclusion of independence, 
however, has to be made with certain reserva- 
tions. Since all the scores are derived from a 
single measuring instrument administered on 
one occasion, they are based to some extent 
upon responding to the same items. The speed 
score is based on all the correct and incorrect 
symbol substitutions. Therefore, error scores 
involve an overlap of certain items contained 
in the speed score. Similarly, the improve- 


Kenneth B. Stein 


ment score has components of both the speed 
and accuracy measures. In order to test fur- 
ther for the independence of these three func- 
tions, a design in which measures are derived 
from experimentally different items will be 
required. 

It remains for future studies to determine 
how the current results compare with samples 
involving other clinical groups such as schizo- 
phrenics and psychotic depressions. Confu- 
sion, motor retardation, and agitation in these 
groups may have disturbing effects upon the 
speed, accuracy, and improvement functions. 


SUMMARY 


Speed, accuracy, and improvement in vis- 
ual motor functioning were investigated in 60 
organic and 120 nonorganic subjects. A 3- 
minute substitution task was employed to 
obtain the data. This task yielded one speed, 
one improvement, and two strictly accuracy 
measures. 

The results indicated that speed, accuracy, 
and improvement in visual motor perform- 
ance were all significantly impaired in the 
brain injured group. The relatively low inter- 
correlations found suggested that the three 
variables may be fairly independent and spe- 
cific factors contributing to the more general 
visual motor function or process. A fourth 
variable, age, appeared to be less independent 
since it correlated significantly with the first 
three variables. The fifth variable, IQ, unlike 
the first four, showed no noticeable influence 
upon visual motor performance. The implica- 
tions of these findings were discussed. 

The discriminant power of each of the 
scores was studied in relation to the number 
of correct classifications of organic and non- 
organic subjects. These findings were then 
compared with the discriminant power of the 
combination of all of the scores. The result 
was that the combined scores yielded a higher 
percentage of correct classifications than the 
individual measures. 

REFERENCES 
Aira, J. A., Armitace, S. G., Rertan, R. M., & 

Rasinowitz, A. The use of psychological tests in 

the evaluation of brain injury. J. gen. Psychol., 

1947, 37, 25-44 


Aten, R. M. The test performance of the brain in- 
jured. J. clin. Psychol., 1947, 3, 225-230. 





Brain Damage and Visual Motor Function 177 


Atten, R. M. The test performance of the brain 
diseased. J. clin. Psychol., 1948, 4, 281-284 

Benper, Lavuretta. Disturbances in 
gestalt function in organic brain disease associated 
with sensory aphasia. Arch. Neurol. Psychiat., 
1933, 30, 514-537. 

FreeMAN, W., & Warts, J. W. An interpretation of 
functions of the frontal lobe based upon observa- 
tions in forty-eight cases of prefrontal lobotomy 
Yale J. Biol. Med., 1939, 11, 527-539 

GoLpMAN, R., Greensiatt, M., & Coon, G. P. Use 
of the Bellevue-Wechsler scale in clinical psychi- 
atry with particular reference to cases with brain 
damage. J. nerv. ment. Dis., 1946, 104, 144-179 

Gotpstein, K., & Scherrer, M. Abstract and 
crete behavior: An experimental study with special 
tests. Psychol. Monogr., 1941, 53(2, Whole No 
239). 

GRAHAM, Frances K., & Kenpacyt, Barpara S. Per- 
formance of brain damaged cases on a Memory- 
For-Designs Test. J. abnorm. soc. Psychol., 1946, 
41, 303-314 

GreenBLATT, M., GotpMan, R., & Coon, G. P. Clin- 
ical implications of the Bellevue-Wechsler test 
(with particular reference to brain damage cases) 
J. nerv. ment. Dis., 1946, 104, 438-442 

Jackson, C. V. Estimating impairment on Wechsler 
Bellevue subtests. J. clin. Psychol., 1955, 11, 137- 
143. 

KLeBanorr, S 
brain lesions 
42, 585-623. 


visuomotor 


con- 


G. Psychological changes in organic 
and ablations. Psychol. Bull., 1945, 


Kresanorr, S. G., Srncer, J. L., & Wirensxy, H 
Psychological consequences of brain lesions and 
ablations. Psychol. Bull., 1954, 51, 1-41 

Lipz, T., Gay, J. R., & Tietze, C. Intelligence in 
cerebral deficit states and schizophrenia measured 
by Kohs Block Test. Arch. Neurol. Psychiat., 1942, 
48, 568-582. 

McFir, J., & Piercy, M. F. Intellectual impairment 
with localized 1952, 75, 
292-311. 

Morrow, R. S., & Marks, J. C 
intelligence and neurologic findings of twenty-two 
patients autopsied for brain damage. J. 
Psychol., 1955, 19, 283-289. 

Puetps, C. K. An analysis of the integrative aspects 
of the performance of normal and brain damaged 

tests. Unpub- 

Kansas University, 


cerebral lesions. Brain, 


The correlation of 


consult 


subjects on specially devised motor 
lished doctoral 


1952 


dissertation, 


Rapaport, D. Diagnostic psychological testing. Vol. I 
Chicago: Year Book, 1945 

Ry tanper, G. Personality changes after operation 
on the frontal lobes: A clinical study of thirty-two 
cases. Copenhagen: Ejnar Munksgard, 1939 

Wecuster, D. Measurement of adult 
Baltimore: Williams & Wilkins, 1944 

Wecuster, D. Wechsler Adult Intelligence 
New York: Psychological Corp., 1955 


intelligence 


Scale 


(Received March 28, 1960) 





Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 178-179 


THE INFLUENCE OF CONTEXT ON THE DEPRESSION 
SCALE OF THE MMPI IN A PSYCHOTIC POPULATION 


GORDON W. OLSON 
Anoka State Hospital, Minnesota 


This report is felt to be of current interest 
and value in view of the recent marketing of 
many antidepressant drugs, an influx which 
has created the need to objectively assess 
their effectiveness. Depression is often an es- 
pecially difficult symptom to assess clinically 
because of the variety of manifestations and 
because it is frequently masked by or mixed 
with more dramatic symptoms such as schizo- 
phrenic apathy and hypochondriacal com- 
plaints. The present study was the result of 
a search for a short, reasonably objective, 
and easily procured measure of depression. A 
striking possibility appeared to be the de- 
pression scale (D) of the MMPI, the pur- 
pose of which was to identify the “state of 
mind characterized by poor morale, lack of 
hope in the future, and dissatisfaction with 
one’s own status generally’ (Hathaway & 
McKinley, 1942). The question posed was 
whether the D scale by itself would measure 
the same thing as when the entire inventory 
is administered, a question that is not an- 
swered by previous studies of reliability of 
the D scale. 

Canter (1960) mentions the often-heard 
criticism that different response sets may be 
elicited if items or single scales are isolated 
from a main body of items, but found in his 
study that the D scale appearing out of con- 
text (but in combination with Pt and K) did 
differentiate among suicidal, nonsuicidal psy- 
chiatric, and nonhospitalized groups. This at- 
tests to the validity of the D scale and sug- 
gests that context played a not-too-important 
role in that instance. 


METHOD 


The 60 items comprising the D scale of the MMPI 
were mimeographed as a separate and columns pro- 
vided to check the items as true or false. Fifty psy- 


chiatric inpatients at Anoka State Hospital were ad- 
ministered consecutively the entire MMPI and the 
60-item D scale. The contained 30 females 
and 20 Subjects (Ss) did not know before- 
hand that a second procedure would follow the first. 
One-half of the group took the entire MMPI first 
and the other half took the D scale only first. Ss 
were not selected with the exception that many 
had been referred for psychological examination and 
others were seen for routine testing on admission 
Statistical analysis involved a ¢ test of the differ- 
ence for related means of the raw scores under the 
two conditions and the Pearsonian correlation co- 
efficient for the 50 pairs of scores. 


sample 


males 


RESULTS AND DISCUSSION 


The mean raw score when the D scale was 
administered alone was 22.1 and it was 22.1 
when a part of the entire test. The standard 
deviations were 6.1 and 6.7, respectively. The 
t for this difference was .02, nonsignificant. 
The r for all pairs of scores was .99 + .14. 
The largest difference found in this sample 
was six points and the median difference was 
zero. 

While the results of this study do not bear 
on the validity of the D scale as a measure of 
depression, they do clearly indicate that what- 
ever is measured by the D scale of the MMPI 
can be measured in a psychiatric population 
without administration of the entire inventory. 

SUMMARY 


ot 


The recent influx of antidepressant drugs 
stimulated the search for a short, objective, 
and easily procured measure of depression. 
The D scale of the MMPI would seem to ful- 
fill these criteria if it could be administered 
separately, but the question presented was 
the familiar one of effect of context on re- 
sponse set. Fifty psychiatric inpatients were 
individually administered the entire MMPI 


178 





Influence of Context on the MMPI 


and the D scale only; one-half took the REFERENCES 

MMPI first and one-half the D scale first. Canter, A. The efficacy of a short form of the 

The correlation of D scores in the two situa- MMPI to evaluate depression and morale loss. J 
. . 7. co lt. Psychol., 1960, 24, 14-17 

tions was .99 and the ¢ nonsignificant. was —s ; . 
7 . ' ee _— , “ ant. It ‘ Hatuaway, S. R., & McKintey, J. C. A multiphasi 

concluded that context did not influence the 


, personality schedule (Minnesota): III. The meas 
response set and that whatever is measured urement of symptomatic depression. J. Psychol, 


by the D scale can be measured without ad- 1942, 10, 249-254. 
ministration of the entire inventory. 


(Received February 26, 196 








Journal of Consulting Psychology 
1961. Vol. 25, No. 2 


80-18] 


SUPPRESSING DISTORTION IN TEMPERAMENT 
INVENTORIES 


JACK HAND ann HERBERT H 


REYNOLDS 


Baylor University 


This study was designed to determine the 
effects of “appraisal’’ and “research” testing 
conditions upon six scales taken from the 
Guilford-Zimmerman Temperament 
(GZTS). 

Subjects consisted of 373 USAF basic 
trainees divided into two groups. In the 
first week of duty both groups completed a 
self-report temperament inventory composed 
of 240 items from the GZTS. Instructions 
for the Research Group were: 


Survey 


I am studying personality tests and need your 
help. This work has no connection with the Air 
Force. Please do not put any identifying marks on 
your answer sheets. 


This was in addition to the regular instruc- 
tions. These instructions were administered 
by the senior author in civilian clothes. The 
Appraisal Group was told: 


As you know, personality characteristics are re- 
lated to success in the Air Force. The tests you are 
about to take will be a matter of récord. 


These instructions were administered by the 
co-author in full uniform (Air Force Cap- 
tain). 

Included in the inventory are the following 
scales: 


DG—8 socially desirable items from the GZTS 
G scale? 

UG—8 socially undesirable items from the GZTS 
G scale 

DR—10 socially desirable items from the GZTS 
R scale 

UR—10 socially undesirable items from the GZTS 
R scale 

SD—a scale designed to measure SD 2 

1SD values for all items in the 
MF) were established by 
(Kelley, 1959). 

2A description of this scale is being prepared for 
publication. The correlation between it and Edwards 
SDS (Edwards, 1957) is .54. 


GZTS 


another 


(except 
investigator 


TABLE 1 


COMPARISON OF APPRAISAL AND RESEARCH GROUPS 
on Srx Clusters oF ITEMS FROM THE GZTS 


Appraisal Research 
(VN = 190 N = 183) 
t ratio 
Variable M SD U SD (M,—M2) 
DG S45 1.79 484 180 2.20* 
DR 6.28 2.16 5.67 2.26 2.66** 
UG 3.52 1.78 3.95 169 2.39" 
UR 3.56 1.85 4.43 1.98 4.39** 
DG+UG 8.77 2.89 8.79 2.76 07 
DR + UR 9.84 3.09 10.10 3.48 16 


*o<.05 
o> <.01 


Each subject received seven scores: DG, 
UG, DG+ UG, DR, UR, DR-+ UR, and 
SD. The DG + UG and DR + UR are scores 
from scales composed of an equal number 
of desirable and undesirable items. 


RESULTS AND DISCUSSION 


Table 1 gives comparisons of group means 
on the temperament variables. These results 
indicate rather clearly the effects of different 
instructions and conditions upon the tempera- 
ment scores. When defensiveness is stimu- 
lated the scores are increased. The insignifi- 
cant group differences on variable DG + UG 
and DR-+ UR indicate, however, that a 
balanced design eliminates the effects of 
defensiveness (provided appraisal conditions 
stimulate defensiveness). 

The product-moment correlations (for ap- 
praisal group) of SD with DG, DR, UG, 
UR, DG + UG, and DR + UR are .32, .49, 
—.34, —.46, .00, and .03 (first four signifi- 
cant at .01 level) further suggesting that the 
balanced design eliminates the influence of 


180 











Distortion in Temperament Inventories 


SD upon temperament scores obtained under REFERENCES 


appraisal conditions. Epwarps, A. L. The social desirability variable in 
Probably the most important interpreta- —. assessment and research. New York: 

: - . , Iryden, 1957. 

eae of this data is that with a balanced Ketiey, F. G. Construction of a scale for the meas- 

design one set of norms may be appropriate urement of social desirability. Unpublished 

for the two most widely applicable testing master’s thesis, Baylor University, 1959 


conditions. (Received March 18, 1960) 














Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 182 


BRIEF REPORTS 


SEDATIVES AND SUGGESTIBILITY IN NEUROTIC PATIENTS? 


J.G 


INGHAM * 


Llandough Hospital, Glamorgan, Wale 


AND J. M. WHITE 
Stanley Royd Hospital, Wakefield, Yorkshire, England 


Ingham (1955) found that neurotic patients 
taking sedatives were significantly more sug- 
gestible than unsedated neurotics. Suggestibility 
might have been increased by sedation in these 
patients, or alternatively the more suggestible pa- 
tients might, though not necessarily intentionally, 
have been selected for sedative treatment. Two 
investigations were done to find out which of 
these interpretations was more likely. 

In the first investigation, 10 male neurotic pa- 
tients were given the same medication (6 grains 
of sodium amytal per day, in divided doses) for 
3 days following admission to hospital. They were 
then divided at random into two groups and took 
part in a 2-day experiment. One group was tested 
(a) after a further day on the same regime and 
again (b) after one day without drugs. For the 
other group, b was followed by a. The arm-move- 
ment test of suggestibility was used. as well as a 
test of static ataxia and arm-movement without 
suggestion. 

This preliminary experiment offered no sup- 
port for the idea that moderate sedation increases 
suggestibility but its value was limited by the 
small number of subjects and by the fact that 
the period without drugs was so short. 


1 An extended report of this study may be ob- 
tained without charge from J. G. Ingham (Medical 
Research Council Social Psychiatry Research Unit, 
Llandough Hospital; Penarth, Glamorgan, Wales; 
United Kingdom) or for a fee from the American 
Documentation Institute. Order Document No. 6543 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washing- 
ton 25, D. C., remitting in advance $1.25 for micro- 
film or $1.25 for photocopies. Make checks payable 
to: Chief, Photoduplication Service, Library of Con- 
gress. 

2 Previously in the Medical Research Council Neu- 
ropsychiatric Research Unit, Whitchurch Hospital, 
Cardiff, Wales. 


In a second investigation, an attempt was made 
to eliminate the difference in suggestibility be- 
tween sedated and unsedated patients by retest- 
ing them after giving the same amount of seda- 
tives to both groups for a few days. If the dif- 
ference is a result of sedation, then it should be 
possible to eliminate it by such a procedure. If 
on the other hand, the difference results from se- 
lection, then it should remain, when the 
groups are equally sedated 


even 
Forty-two male neurotic patients (15 of them 
sedated before admission) were tested on admis- 
sion to hospital and ag days later. During 
patients received 6 
grains of sodium amytal per day, in divided doses 
It was again found that admission 
previously sedated patients showed significantly 
greater arm-movement suggestibility than unse- 
dated patients, though both were significantly 
more suggestible than a group of 27 normal males. 


On retest, after 3 days of medication, the signifi- 


the intervening period all 


tested on 


cant difference in suggestibility between previ- 
ously sedated and unsedated patients remained. 
There was no indication of an increase in sug- 
gestibility, following medication, in either group. 
The findings support the hypothesis that there 
was a selection factor operating, whereby the 
more suggestible patients were more likely to be 
receiving sedatives. 

Additional results from both investigations sug- 
gest that static ataxia increases following moder- 
ate sedation. 


REFERENCE 
IncHaAM, J. G. Psychoneurosis and suggestibility. J. 


abnorm. soc. Psychol., 1955, 51, 600-603. 


(Received May 27, 1960) 


182 











Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 18 


SEMANTIC DIFFERENTIAL RATING OF SELF AND OF 
SELF-REPORTED PERSONAL CHARACTERISTICS ' 


JAMES E. MADDEN 


Veteran 


The semantic differential has been used as a 
measure of similarity of affective reactions to 
various concepts. When two or more concepts are 
rated similarly (have close profile agreement) the 
fact can be interpreted in clinical psychological 
terms. For example, equating the “self” concept 
and “descriptive” concepts suggests that the de- 
scriptive concepts are important aspects of the 
individual’s sense of self. The present study re- 
veals the extent to which this exampled interpre- 
tation may be justifiable. The semantic differen- 
tial profiles of “I, myself” and independently 
indicated aspects of self (self-reported personal 
characteristics) are compared, to see if in fact 
self and aspects are rated similarly. In addition 
to “close” profile agreement the study deals with 
other degrees of agreement 
between profile agreement 


to show the relation 
and the probability 


that a descriptive concept is an aspect of self 
Fifty-nine concepts (personal characteristics) 
were derived from Mf scale items of the MMPI 
For the rating task, the items were reworded and 
presented in the third person singular (ie., “A 


person who . . .”). All items were rated by each 
S. However, only the concepts derived from items 
marked True by S during an independent ad- 
ministration of the items in their usual self-report 
form, are considered to represent aspects of his 


1An extended report of this study may be ob- 
tained without charge from James E. Madden (Clini- 
cal Psychology Service, Veterans Administration Hos- 
pital; Chillicothe, Ohio) or for a fee from the Ameri- 
can Documentation Institute. Order Document No. 
6542 from ADI Auxiliary Publications Project, Pho- 
toduplication Service, Library of Congress; Wash- 
ington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 

The data reported here ate from the writer’s mas- 
ter’s thesis completed at the University of Kentucky 
in 1956. 


idministration Hospital, Chillicothe, Ohio 


sense of self. When S marks an item True, he 
virtually implies “I am a person who . ” Also 
rated was the concept “I, myself.” 

A set of 15 bipolar seven-step scales (five 
evaluative, five potency, and five activity) was 
used to rate each concept. The square root of the 
sum of squared distances was employed as the 
measure of profile agreement or Distance be- 
tween “I, myself” and each of the other concepts 

Thirty college students were Ss. The Mf items 
in their usual True-False form, were administered 
to half the Ss approximately a week before they 
rated the concepts on the semantic differential 
The remaining half had the reverse sequence of 
tasks. 

The range of Distances for each S$ was divided 
into tenths. In each S’s data the percentage of 
True items (those which he had independently 
marked True) was computed in each of the 10 
regions. Data were adjusted so that a 50% value 
for a region meant that True items had no greater 
or less tendency than False items did to occur in 
that region. The percentages of True items in the 
regions were then averaged across Ss. 

For all 30 Ss the resulting percentage quantities, 
from Region 1 (closest to “I, myself”) through 
Region 10 (most distant from “I, myself’), were 
as follows: 66.8, 65.1 51.6, 45.7, 40.4 
23.9, 21.2, 20.6. The mean percentage of True 
items in the first five regions is significantly 
larger than the mean percentage in the second 
five regions, beyond the .01 level. Results were 
similar for sex and sequence groups of Ss. The 
probability that a descriptive concept is an as- 
pect of self, is seen to be directly related to the 
amount of agreement between the ratings of self 
and the concept. From the data’s negative facets 
(e.g., some False items in close regions and some 
True items in distant regions), some challenging 
theoretical possibilities can be gleaned 


56.8 33.5 


(Received June 13, 1960) 











Journal of Consulting Psychology 
1961, Vol. 25, No. 2, 184 


FIELD DEPENDENCE, 


MANIFEST ANXIETY, AND 


SOCIOMETRIC STATUS IN CHILDREN ' 


IRA ISCOE ann JOYCE ANN CARDEN 


University of Texas 


The Children’s Manifest Anxiety Scale (CMAS) 
as a measure of anxiety or drive level, a rating 
method of sociometric choice (rating own sex) 
employing three choice criteria as an index of so- 
cial status, and Witkin’s Embedded Figures Test 
as a measure of field dependence-independence 
were each administered to an entire sixth grade 
class composed of 16 boys and 15 girls. Mean 
age of the girls was 11 years 7 months, for the 
boys 11 years 11 months. Mean IQ was 118 for 
girls, 116 for boys. They were homogeneous with 
respect to religion and socioeconomic class. They 
had been acquainted for at least 7 months prior 
to the study. 

A significant rank-order correlation of .57 (p 
< .05) was obtained between sociometric status 
and field dependence in girls. For boys the re- 
sults were all in the opposite direction, with the 
choice of “class officer” correlating —.51 (p< 
.05) with field dependence. The data suggest that 
popular boys are more likely to exhibit an active 
field analytic orientation and girls a passive field 
dependent one. CMAS scores were not related to 
sociometric status for boys while significant nega- 
tive correlations (p< .05) on all criteria ques- 
tions were obtained for girls. In addition, the 
number of rejections received was significantly 
related to anxiety level, the r being .65 (p < .01). 
It would appear that more frequently chosen 
girls tend to have a lower drive level (anxiety) 
while under chosen girls have a higher level. The 
more rejected a girl is by her peers, the higher 


1An extended report of this study may be ob- 
tained without charge from Ira Iscoe (Department of 
Psychology, University of Texas; Austin 12, Texas) 
or for a fee from the American Documentation Insti- 
tute. Order Document No. 6541 from ADI Auxiliary 
Publications Project, Photoduplication Service, Li- 
brary of Congress; Washington 25, D. C., remitting 
in advance $1.75 for microfilm or $2.50 for photo- 
copies. Make checks payable to: Chief, Photodupli- 
cation Service, Library of Congress. 





her drive level. For boys, the correlation between 
drive level and field dependency was positive but 
not significant. For girls an r of — .60 was ob- 
tained (p< .01). This would indicate that field 
independent girls tend to be more anxious than 
field dependent ones. Level of intelligence was 
not significantly related to any of the socio- 
metric choices, nor to scores on the CMAS. Some 
support for a significant negative relationship be- 
tween field dependence and intelligence was ob- 
tained for girls but not for boys. 

The results offer some support to the findings 
of Witkin and his associates, in regard to the field 
dependence-independence dimension and person- 
ality characteristics. In the present study, at the 
age and cultural level employed, the girl who is 
an active initiator and organizer is not likely to 
enjoy high social status with her peers. In con- 
trast, the relatively field independent boy is most 
likely to gain wider acceptance by his classmates. 

The descriptions Witkin uses to characterize 
field independent vs. dependent persons might 
well represent the kinds of behavior our middle 
class culture fosters and rewards at these ages. 
Boys are expected to be somewhat aggressive, di- 
rect, and analytic, while girls are taught a more 
submissive, conforming, “ladylike” type of be- 
havior. The girl who identifies with this role gains 
acceptance and is subjectively aware of fewer 
discomforts as picked up by the CMAS. For 
girls, an analytic (field independent) mode of per- 
ceiving results in less popularity and more anx- 
iety. The reversal of the relationship between 
field dependence and social status in boys per- 
haps emphasizes the cultural rewards for their 
exhibiting initiative. The fact that the types of 
behavior examined in the current study have some 
correlates to modes of perceiving points again 
to the intricate relationships that exist between 
perception and pérsonality. 

(Received 


August 1, 1960) 


184 








NOW REVISED . 


THE HENMON-NELSON TESTS OF MENTAL ABILITY— 
College Level, Revised Edition 


Usable at any time during the college years and first year of graduate 
school, this omnibus cycle group test of mental ability is especially valuable 
for entrance or placemen t examinations usually given to entering freshmen. 


The features of these completely revised tests include ease of administra- 
tion with clear, simple directions, one convenient working time period of 40 
minutes, and a simple scoring system. Although these are essentially single 
score rather than multiple-factor tests, provision has been made for those who 
wish to obtain separate quantitative and verbal scores. Self-marking answer 
sheets requiring no separate keys or IBM answer sheets may be used. 


The tests, printed in reusable booklets, are accompanied by an EX- 
AMINER’S MANUAL containing complete instructions for using the tests as 


well as reliability, validity, and other statistical data. 


A Spring 1961 Publication 


Houghton M ifn 


sostoss NEW YOF 
GENEVA It DA AS 














Members of the American Psychological Association may 


obtain Professional Liability Insurance protecting 
against Any claim 


f. ee MAXIMUM LIMITS 
e « ¢ for any alleged liability in connection YOUR COST 
with the rendering, or failure to render, pro- DER CLAIM | PER YEAR PER YEAR 
fessional services. Our company provides legal a ; F 

counsel by a specialist in the liability field and : 

ones allt on to the limit ; ted ° : $ 5,000 $ 15,000 $15.00 
pays all expenses up to the limits stated in your 10.000 30.000 20.25 
policy. 15,000 15,000 23.25 

: , , 20,000 60,000 24.60 

Pp . ; ins ' cs 20, . 2 

Plan dey eloped in cooperation with your Asso 25, 000 <5 000 25,65 
ciation’s Committee on Professional Liability 30,000 90,000 26.55 
Insurance. 40,000 120,000 27.60 
50,000 150,000 28.35 
100,000 300,000 30.90 


ea ee eS ee ee ee ee ee ee ee ee ee 


THE SMITH-HOGGATT-DAWSON AGENCY, Champaign, Illinois 
Available 


exclusively from 
Smith-Hoggatt-Dawson Name 


Agency 
CHAMPAIGN, ILLINOIS 


ease send me an application for Professional Liability Insurance. 
Pl 1 licat for Prof: | Liability I ar 


Address 


City 














ANNOUNCING ... 


THE JOURNAL OF HUMANISTIC PSYCHOLOGY 


(Published semi-annually by Brandeis University 
Waltham 54, Massachusetts) 


BOARD OF EDITORS 
Joe K. Apams. Phd ANTHONY J. Suricu Hienry WINTHROP, Phd 
Consulting Editor Evrecutive Editor Book Review Editor 
Harriet O'NEILL, Technical Editor 
ANDRAS ANGYAL, M.D. RoBerRT HARTMAN, Phd. CLARK MovustTakas, Phd. 
CHARLOTTE BUHLER, Phd. DoroTHy Ler, Phd. Lewis Mumrorp, Phd. 


Kukt GoLpsTein, M.D. ABRAHAM H. Mas.Low, Phd. Davip RIesMAN, Phd 
ROLLO May, Phd. 


SPRING ISSUE—1961—Table of Contents 


A. H. MASLOW: Health as Transcendence of Environment; C. BUHLER: The Goal-Structure 
of Human Life; C. MOUSTAKAS: The Sense of Self; H. WINTHROP: Can We Educate fo 
a Sense of Values?; 8. COHEN: A Growth Theory of Neurotic Resistance to Therapy; F. J. 
SHAW: The Problem of Acting and the Problem of Becoming; G. NAMECHE: Two Pictures 
of Man; D. SMILLIE: The Roots of Personal Existence; A. VAN KAAM: Humanistic Psy 
chology and Culture; H. B. ENGLISH: Education of the Emotions; E. G. SCHACHTEL: On 
Alienated Conce pts of Identity; H. WINTHROP: Book Review Section. 


Subscription: $5.00 Single Issue: $2.75 


Business and Editorial offices: 2637 Marshall Drive, Palo Alto, California. 
Remittances payable to: Journal of Humanistic Psychology. 

















“The new and different in tests. . . 


Could you use objectively obtained esti- 


mates of strengths and interrelationships 
of the id, ego, and superego im your 
diagnostic work? In your research on 
personality? Are you just plain curious 
about how these basic concepts of psy- 


choanalytic theory might be « quantified? 
The 


E 
S 


TEST 


utilizes four independent subtcsts to 
measure important aspects of id, ego, 
and superego functioning. Materials 
are easy to handle. Tasks are disarm- 
ing and are intrinsically interesting to 
almost anyone. Evaluation takes as 


much time, thought, and sophistication 
as for any other test giving important 
basic information about the personality. 


Obtain information and/or order (coni- 
plete set $28.50) from 
PSYCHOLOGICAL TEST 
SPECIALISTS 
Box 1441 
Missoula, Montana 


“The new and different in tests, . . .” 

















Why 
DEVEREUX SCHOOLS 
For Retarded Children? 


Fifty per cent of the facilities of Deverev« 


Schools are specifically designed for childr« 
with learning problems. Each child's program 


provides hi.n with several unique benefits 


p> It is designed specifically for the indivi: 
val child. 


> It is based on full medical, psychict: 
psychological and educationa! studi 


> ht is supervised by a multi-disciplinc: 
professional team. 


> The total environment is therapeutica!!) 
structured for optimal emotiona a: 
well as academic growth. 


Professional inquiries are invited. East Coast resid: 
address Charles J. Fowler, Director of Admissions, Devereu» 
Schools, Devon, Penrsylvenia. West Coast residents, addres 
Keith A. Seaton, Registrar, Devereyx Schools, Box | 
Senta Barbora, California. Southwestern residents, odd 
John M. Barclay, Director of Development, Devereux Scho 
Box 336, Victoria, Texas. 


THE 
DEVEREUX. SCHOOLS 
FOUNDATION COMMUNIT 


CAMPS 


A nong rofit organization 
Founded 1912 

Deven, Pennsylvania RESEARCH 
Sante Barbara, California 
Victoria, Texas 


TRAINING 





HELENA T. DEVEREUX EDWARD L. FRENCH, Pi 
Administrative Consultant Director 














— a 


So a 


ee ee 


